Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 45476 |
| Missing cells | 26218 |
| Missing cells (%) | 2.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 7.3 MiB |
| Average record size in memory | 168.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 11 |
Genres has a high cardinality: 4068 distinct values | High cardinality |
OriginalLanguage has a high cardinality: 93 distinct values | High cardinality |
Overview has a high cardinality: 44234 distinct values | High cardinality |
ProductionCompanies has a high cardinality: 22667 distinct values | High cardinality |
ProductionCountries has a high cardinality: 2390 distinct values | High cardinality |
ReleaseDate has a high cardinality: 17334 distinct values | High cardinality |
Tagline has a high cardinality: 20269 distinct values | High cardinality |
Title has a high cardinality: 42197 distinct values | High cardinality |
Director has a high cardinality: 17573 distinct values | High cardinality |
MovieCharacter has a high cardinality: 40180 distinct values | High cardinality |
ActorName has a high cardinality: 42678 distinct values | High cardinality |
Budget is highly overall correlated with Revenue and 1 other fields | High correlation |
Popularity is highly overall correlated with VoteCount | High correlation |
Revenue is highly overall correlated with Budget and 2 other fields | High correlation |
VoteCount is highly overall correlated with Popularity and 1 other fields | High correlation |
Return is highly overall correlated with Budget and 1 other fields | High correlation |
OriginalLanguage is highly imbalanced (67.4%) | Imbalance |
ProductionCountries is highly imbalanced (58.3%) | Imbalance |
Tagline has 25078 (55.1%) missing values | Missing |
Popularity is highly skewed (γ1 = 29.21506573) | Skewed |
Return is highly skewed (γ1 = 138.3340992) | Skewed |
Tagline is uniformly distributed | Uniform |
Title is uniformly distributed | Uniform |
Budget has 36490 (80.2%) zeros | Zeros |
Revenue has 37972 (83.5%) zeros | Zeros |
Runtime has 1535 (3.4%) zeros | Zeros |
VoteAverage has 2947 (6.5%) zeros | Zeros |
VoteCount has 2849 (6.3%) zeros | Zeros |
Return has 39998 (88.0%) zeros | Zeros |
Reproduction
| Analysis started | 2023-06-13 15:11:30.412539 |
|---|---|
| Analysis finished | 2023-06-13 15:12:16.736040 |
| Duration | 46.32 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
Budget
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 1223 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 100 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4232604.4 |
| Minimum | 0 |
|---|---|
| Maximum | 3.8 × 108 |
| Zeros | 36490 |
| Zeros (%) | 80.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 25000000 |
| Maximum | 3.8 × 108 |
| Range | 3.8 × 108 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 17439860 |
|---|---|
| Coefficient of variation (CV) | 4.1203614 |
| Kurtosis | 66.634491 |
| Mean | 4232604.4 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.1183385 |
| Sum | 1.9205866 × 1011 |
| Variance | 3.041487 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 36490 | |
| 5000000 | 286 | 0.6% |
| 10000000 | 259 | 0.6% |
| 20000000 | 243 | 0.5% |
| 2000000 | 242 | 0.5% |
| 15000000 | 226 | 0.5% |
| 3000000 | 223 | 0.5% |
| 25000000 | 206 | 0.5% |
| 1000000 | 197 | 0.4% |
| 30000000 | 190 | 0.4% |
| Other values (1213) | 6814 | 15.0% |
| Value | Count | Frequency (%) |
| 0 | 36490 | |
| 1 | 25 | 0.1% |
| 2 | 14 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 8 | < 0.1% |
| 5 | 8 | < 0.1% |
| 6 | 5 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 5 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 380000000 | 1 | < 0.1% |
| 300000000 | 1 | < 0.1% |
| 280000000 | 1 | < 0.1% |
| 270000000 | 1 | < 0.1% |
| 260000000 | 3 | < 0.1% |
| 258000000 | 1 | < 0.1% |
| 255000000 | 1 | < 0.1% |
| 250000000 | 10 | |
| 245000000 | 2 | < 0.1% |
| 237000000 | 1 | < 0.1% |
Genres
Categorical
| Distinct | 4068 |
|---|---|
| Distinct (%) | 8.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 355.4 KiB |
| Drama | |
|---|---|
| Comedy | |
| Documentary | 2713 |
| NoGenre | 2481 |
| Drama, Romance | 1301 |
| Other values (4063) |
Length
| Max length | 84 |
|---|---|
| Median length | 68 |
| Mean length | 15.950567 |
| Min length | 3 |
Characters and Unicode
| Total characters | 725368 |
|---|---|
| Distinct characters | 41 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2367 ? |
|---|---|
| Unique (%) | 5.2% |
Sample
| 1st row | Animation, Comedy, Family |
|---|---|
| 2nd row | Adventure, Fantasy, Family |
| 3rd row | Romance, Comedy |
| 4th row | Comedy, Drama, Romance |
| 5th row | Comedy |
Common Values
| Value | Count | Frequency (%) |
| Drama | 4998 | 11.0% |
| Comedy | 3621 | 8.0% |
| Documentary | 2713 | 6.0% |
| NoGenre | 2481 | 5.5% |
| Drama, Romance | 1301 | 2.9% |
| Comedy, Drama | 1135 | 2.5% |
| Horror | 974 | 2.1% |
| Comedy, Romance | 930 | 2.0% |
| Comedy, Drama, Romance | 593 | 1.3% |
| Drama, Comedy | 532 | 1.2% |
| Other values (4058) | 26198 |
Length
| Value | Count | Frequency (%) |
| drama | 20255 | |
| comedy | 13181 | |
| thriller | 7619 | 7.8% |
| romance | 6733 | 6.9% |
| action | 6592 | 6.8% |
| horror | 4670 | 4.8% |
| crime | 4305 | 4.4% |
| documentary | 3921 | 4.0% |
| adventure | 3494 | 3.6% |
| science | 3042 | 3.1% |
| Other values (37) | 23540 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 71563 | 9.9% |
| a | 61822 | 8.5% |
| e | 60748 | 8.4% |
| m | 53101 | 7.3% |
| 51876 | 7.2% | |
| o | 51022 | 7.0% |
| , | 48053 | 6.6% |
| i | 39670 | 5.5% |
| n | 38157 | 5.3% |
| y | 28510 | 3.9% |
| Other values (31) | 220846 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 524833 | |
| Uppercase Letter | 100606 | 13.9% |
| Space Separator | 51876 | 7.2% |
| Other Punctuation | 48053 | 6.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 71563 | |
| a | 61822 | |
| e | 60748 | |
| m | 53101 | |
| o | 51022 | |
| i | 39670 | |
| n | 38157 | |
| y | 28510 | 5.4% |
| c | 27977 | 5.3% |
| t | 26210 | 5.0% |
| Other values (12) | 66053 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 24176 | |
| C | 17489 | |
| A | 12020 | |
| F | 9746 | |
| T | 8389 | 8.3% |
| R | 6735 | 6.7% |
| H | 6068 | 6.0% |
| M | 4830 | 4.8% |
| S | 3046 | 3.0% |
| G | 2483 | 2.5% |
| Other values (7) | 5624 | 5.6% |
Space Separator
| Value | Count | Frequency (%) |
| 51876 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 48053 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 625439 | |
| Common | 99929 | 13.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 71563 | |
| a | 61822 | 9.9% |
| e | 60748 | 9.7% |
| m | 53101 | 8.5% |
| o | 51022 | 8.2% |
| i | 39670 | 6.3% |
| n | 38157 | 6.1% |
| y | 28510 | 4.6% |
| c | 27977 | 4.5% |
| t | 26210 | 4.2% |
| Other values (29) | 166659 |
Common
| Value | Count | Frequency (%) |
| 51876 | ||
| , | 48053 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 725368 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 71563 | 9.9% |
| a | 61822 | 8.5% |
| e | 60748 | 8.4% |
| m | 53101 | 7.3% |
| 51876 | 7.2% | |
| o | 51022 | 7.0% |
| , | 48053 | 6.6% |
| i | 39670 | 5.5% |
| n | 38157 | 5.3% |
| y | 28510 | 3.9% |
| Other values (31) | 220846 |
OriginalLanguage
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 93 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 355.4 KiB |
| en | |
|---|---|
| fr | 2437 |
| it | 1528 |
| ja | 1349 |
| de | 1078 |
| Other values (88) |
Length
| Max length | 10 |
|---|---|
| Median length | 2 |
| Mean length | 2.019153 |
| Min length | 2 |
Characters and Unicode
| Total characters | 91823 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | en |
|---|---|
| 2nd row | en |
| 3rd row | en |
| 4th row | en |
| 5th row | en |
Common Values
| Value | Count | Frequency (%) |
| en | 32202 | |
| fr | 2437 | 5.4% |
| it | 1528 | 3.4% |
| ja | 1349 | 3.0% |
| de | 1078 | 2.4% |
| es | 992 | 2.2% |
| ru | 822 | 1.8% |
| hi | 508 | 1.1% |
| ko | 444 | 1.0% |
| zh | 408 | 0.9% |
| Other values (83) | 3708 | 8.2% |
Length
| Value | Count | Frequency (%) |
| en | 32202 | |
| fr | 2437 | 5.4% |
| it | 1528 | 3.4% |
| ja | 1349 | 3.0% |
| de | 1078 | 2.4% |
| es | 992 | 2.2% |
| ru | 822 | 1.8% |
| hi | 508 | 1.1% |
| ko | 444 | 1.0% |
| zh | 408 | 0.9% |
| Other values (83) | 3708 | 8.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 34635 | |
| n | 33018 | |
| r | 3630 | 4.0% |
| f | 2835 | 3.1% |
| i | 2388 | 2.6% |
| t | 2250 | 2.5% |
| a | 2055 | 2.2% |
| s | 1652 | 1.8% |
| j | 1350 | 1.5% |
| d | 1323 | 1.4% |
| Other values (25) | 6687 | 7.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 91594 | |
| Uppercase Letter | 216 | 0.2% |
| Decimal Number | 10 | < 0.1% |
| Other Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 34635 | |
| n | 33018 | |
| r | 3630 | 4.0% |
| f | 2835 | 3.1% |
| i | 2388 | 2.6% |
| t | 2250 | 2.5% |
| a | 2055 | 2.2% |
| s | 1652 | 1.8% |
| j | 1350 | 1.5% |
| d | 1323 | 1.4% |
| Other values (16) | 6458 | 7.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 8 | 2 | |
| 2 | 1 | 10.0% |
| 6 | 1 | 10.0% |
| 1 | 1 | 10.0% |
| 4 | 1 | 10.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 108 | |
| L | 108 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 91810 | |
| Common | 13 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 34635 | |
| n | 33018 | |
| r | 3630 | 4.0% |
| f | 2835 | 3.1% |
| i | 2388 | 2.6% |
| t | 2250 | 2.5% |
| a | 2055 | 2.2% |
| s | 1652 | 1.8% |
| j | 1350 | 1.5% |
| d | 1323 | 1.4% |
| Other values (18) | 6674 | 7.3% |
Common
| Value | Count | Frequency (%) |
| 0 | 4 | |
| . | 3 | |
| 8 | 2 | |
| 2 | 1 | 7.7% |
| 6 | 1 | 7.7% |
| 1 | 1 | 7.7% |
| 4 | 1 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 91823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 34635 | |
| n | 33018 | |
| r | 3630 | 4.0% |
| f | 2835 | 3.1% |
| i | 2388 | 2.6% |
| t | 2250 | 2.5% |
| a | 2055 | 2.2% |
| s | 1652 | 1.8% |
| j | 1350 | 1.5% |
| d | 1323 | 1.4% |
| Other values (25) | 6687 | 7.3% |
Overview
Categorical
| Distinct | 44234 |
|---|---|
| Distinct (%) | 97.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 355.4 KiB |
| NoOverview | 1038 |
|---|---|
| No overview found. | 133 |
| No Overview | 7 |
| 5 | |
| Released | 3 |
| Other values (44229) |
Length
| Max length | 1000 |
|---|---|
| Median length | 791 |
| Mean length | 316.12519 |
| Min length | 1 |
Characters and Unicode
| Total characters | 14376109 |
|---|---|
| Distinct characters | 429 |
| Distinct categories | 25 ? |
| Distinct scripts | 13 ? |
| Distinct blocks | 21 ? |
Unique
| Unique | 44173 ? |
|---|---|
| Unique (%) | 97.1% |
Sample
| 1st row | Led by Woody, Andy's toys live happily in his room until Andy's birthday brings Buzz Lightyear onto the scene. Afraid of losing his place in Andy's heart, Woody plots against Buzz. But when circumstances separate Buzz and Woody from their owner, the duo eventually learns to put aside their differences. |
|---|---|
| 2nd row | When siblings Judy and Peter discover an enchanted board game that opens the door to a magical world, they unwittingly invite Alan -- an adult who's been trapped inside the game for 26 years -- into their living room. Alan's only hope for freedom is to finish the game, which proves risky as all three find themselves running from giant rhinoceroses, evil monkeys and other terrifying creatures. |
| 3rd row | A family wedding reignites the ancient feud between next-door neighbors and fishing buddies John and Max. Meanwhile, a sultry Italian divorcée opens a restaurant at the local bait shop, alarming the locals who worry she'll scare the fish away. But she's less interested in seafood than she is in cooking up a hot time with Max. |
| 4th row | Cheated on, mistreated and stepped on, the women are holding their breath, waiting for the elusive "good man" to break a string of less-than-stellar lovers. Friends and confidants Vannah, Bernie, Glo and Robin talk it all out, determined to find a better way to breathe. |
| 5th row | Just when George Banks has recovered from his daughter's wedding, he receives the news that she's pregnant ... and that George's wife, Nina, is expecting too. He was planning on selling their home, but that's a plan that -- like George -- will have to change with the arrival of both a grandchild and a kid of his own. |
Common Values
| Value | Count | Frequency (%) |
| NoOverview | 1038 | 2.3% |
| No overview found. | 133 | 0.3% |
| No Overview | 7 | < 0.1% |
| 5 | < 0.1% | |
| Released | 3 | < 0.1% |
| Recovering from a nail gun shot to the head and 13 months of coma, doctor Pekka Valinta starts to unravel the mystery of his past, still suffering from total amnesia. | 3 | < 0.1% |
| King Lear, old and tired, divides his kingdom among his daughters, giving great importance to their protestations of love for him. When Cordelia, youngest and most honest, refuses to idly flatter the old man in return for favor, he banishes her and turns for support to his remaining daughters. But Goneril and Regan have no love for him and instead plot to take all his power from him. In a parallel, Lear's loyal courtier Gloucester favors his illegitimate son Edmund after being told lies about his faithful son Edgar. Madness and tragedy befall both ill-starred fathers. | 3 | < 0.1% |
| No movie overview available. | 3 | < 0.1% |
| Adaptation of the Jane Austen novel. | 3 | < 0.1% |
| A few funny little novels about different aspects of life. | 3 | < 0.1% |
| Other values (44224) | 44275 |
Length
| Value | Count | Frequency (%) |
| the | 138082 | 5.6% |
| a | 98889 | 4.0% |
| and | 75259 | 3.1% |
| to | 73321 | 3.0% |
| of | 69574 | 2.8% |
| in | 48143 | 2.0% |
| is | 36500 | 1.5% |
| his | 36165 | 1.5% |
| with | 23902 | 1.0% |
| her | 21484 | 0.9% |
| Other values (97092) | 1828430 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2406350 | ||
| e | 1365872 | 9.5% |
| a | 940505 | 6.5% |
| t | 934766 | 6.5% |
| i | 852552 | 5.9% |
| o | 830911 | 5.8% |
| n | 822601 | 5.7% |
| s | 767854 | 5.3% |
| r | 745312 | 5.2% |
| h | 600810 | 4.2% |
| Other values (419) | 4108576 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11158386 | |
| Space Separator | 2406388 | 16.7% |
| Uppercase Letter | 393041 | 2.7% |
| Other Punctuation | 312824 | 2.2% |
| Decimal Number | 42223 | 0.3% |
| Dash Punctuation | 36767 | 0.3% |
| Close Punctuation | 10100 | 0.1% |
| Open Punctuation | 10077 | 0.1% |
| Final Punctuation | 4556 | < 0.1% |
| Initial Punctuation | 882 | < 0.1% |
| Other values (15) | 865 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1365872 | |
| a | 940505 | 8.4% |
| t | 934766 | 8.4% |
| i | 852552 | 7.6% |
| o | 830911 | 7.4% |
| n | 822601 | 7.4% |
| s | 767854 | 6.9% |
| r | 745312 | 6.7% |
| h | 600810 | 5.4% |
| l | 478816 | 4.3% |
| Other values (142) | 2818387 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 42751 | 10.9% |
| T | 35968 | 9.2% |
| S | 31126 | 7.9% |
| M | 23954 | 6.1% |
| B | 23699 | 6.0% |
| C | 22803 | 5.8% |
| H | 19429 | 4.9% |
| W | 18652 | 4.7% |
| I | 16798 | 4.3% |
| D | 16311 | 4.1% |
| Other values (77) | 141550 |
Other Letter
| Value | Count | Frequency (%) |
| न | 6 | 4.8% |
| र | 6 | 4.8% |
| म | 5 | 4.0% |
| の | 4 | 3.2% |
| द | 3 | 2.4% |
| प | 3 | 2.4% |
| ద | 3 | 2.4% |
| अ | 3 | 2.4% |
| व | 2 | 1.6% |
| م | 2 | 1.6% |
| Other values (76) | 88 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 133443 | |
| . | 124794 | |
| ' | 31121 | 9.9% |
| " | 11661 | 3.7% |
| : | 3299 | 1.1% |
| ? | 2759 | 0.9% |
| ; | 2493 | 0.8% |
| ! | 1543 | 0.5% |
| / | 765 | 0.2% |
| & | 453 | 0.1% |
| Other values (12) | 493 | 0.2% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 4 | |
| ి | 4 | |
| ் | 3 | |
| ్ | 3 | |
| ् | 3 | |
| ̈ | 3 | |
| ా | 2 | 6.1% |
| े | 2 | 6.1% |
| ं | 2 | 6.1% |
| ु | 2 | 6.1% |
| Other values (4) | 5 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 9748 | |
| 0 | 8265 | |
| 9 | 6405 | |
| 2 | 4251 | |
| 5 | 2440 | 5.8% |
| 8 | 2379 | 5.6% |
| 3 | 2342 | 5.5% |
| 4 | 2176 | 5.2% |
| 7 | 2131 | 5.0% |
| 6 | 2086 | 4.9% |
Spacing Mark
| Value | Count | Frequency (%) |
| ा | 11 | |
| ी | 4 | 14.8% |
| ो | 3 | 11.1% |
| ు | 3 | 11.1% |
| ि | 2 | 7.4% |
| ு | 2 | 7.4% |
| ం | 1 | 3.7% |
| ி | 1 | 3.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 35244 | |
| – | 881 | 2.4% |
| — | 633 | 1.7% |
| ― | 5 | < 0.1% |
| ‐ | 4 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ® | 45 | |
| ™ | 14 | 21.9% |
| ¦ | 2 | 3.1% |
| ° | 2 | 3.1% |
| � | 1 | 1.6% |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 20 | |
| + | 11 | |
| = | 6 | 15.0% |
| | | 2 | 5.0% |
| − | 1 | 2.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 10024 | |
| [ | 50 | 0.5% |
| { | 2 | < 0.1% |
| „ | 1 | < 0.1% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 317 | |
| £ | 10 | 3.0% |
| ₹ | 1 | 0.3% |
| € | 1 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 2406350 | ||
| 36 | < 0.1% | |
| 2 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 10048 | |
| ] | 50 | 0.5% |
| } | 2 | < 0.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 3847 | |
| ” | 690 | 15.1% |
| » | 19 | 0.4% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 672 | |
| ‘ | 192 | 21.8% |
| « | 18 | 2.0% |
Control
| Value | Count | Frequency (%) |
| 106 | ||
| | 3 | 2.7% |
| | 1 | 0.9% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 25 | |
| ` | 12 | |
| ¯ | 1 | 2.6% |
Format
| Value | Count | Frequency (%) |
| | 31 | |
| | 20 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 8 | |
| ¹ | 8 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 19 |
Line Separator
| Value | Count | Frequency (%) |
| 7 |
Letter Number
| Value | Count | Frequency (%) |
| Ⅱ | 2 |
Paragraph Separator
| Value | Count | Frequency (%) |
| 2 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʼ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11546195 | |
| Common | 2824495 | 19.6% |
| Cyrillic | 4587 | < 0.1% |
| Greek | 648 | < 0.1% |
| Devanagari | 77 | < 0.1% |
| Telugu | 30 | < 0.1% |
| Hiragana | 20 | < 0.1% |
| Tamil | 19 | < 0.1% |
| Han | 10 | < 0.1% |
| Hangul | 9 | < 0.1% |
| Other values (3) | 19 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1365872 | |
| a | 940505 | 8.1% |
| t | 934766 | 8.1% |
| i | 852552 | 7.4% |
| o | 830911 | 7.2% |
| n | 822601 | 7.1% |
| s | 767854 | 6.7% |
| r | 745312 | 6.5% |
| h | 600810 | 5.2% |
| l | 478816 | 4.1% |
| Other values (132) | 3206196 |
Common
| Value | Count | Frequency (%) |
| 2406350 | ||
| , | 133443 | 4.7% |
| . | 124794 | 4.4% |
| - | 35244 | 1.2% |
| ' | 31121 | 1.1% |
| " | 11661 | 0.4% |
| ) | 10048 | 0.4% |
| ( | 10024 | 0.4% |
| 1 | 9748 | 0.3% |
| 0 | 8265 | 0.3% |
| Other values (71) | 43797 | 1.6% |
Cyrillic
| Value | Count | Frequency (%) |
| о | 470 | 10.2% |
| е | 404 | 8.8% |
| а | 373 | 8.1% |
| н | 323 | 7.0% |
| и | 299 | 6.5% |
| т | 265 | 5.8% |
| р | 240 | 5.2% |
| с | 218 | 4.8% |
| в | 173 | 3.8% |
| л | 161 | 3.5% |
| Other values (46) | 1661 |
Greek
| Value | Count | Frequency (%) |
| α | 60 | 9.3% |
| ο | 55 | 8.5% |
| τ | 43 | 6.6% |
| ι | 36 | 5.6% |
| η | 36 | 5.6% |
| ν | 34 | 5.2% |
| ε | 31 | 4.8% |
| ρ | 31 | 4.8% |
| π | 30 | 4.6% |
| ς | 30 | 4.6% |
| Other values (33) | 262 |
Devanagari
| Value | Count | Frequency (%) |
| ा | 11 | 14.3% |
| न | 6 | 7.8% |
| र | 6 | 7.8% |
| म | 5 | 6.5% |
| ी | 4 | 5.2% |
| द | 3 | 3.9% |
| ो | 3 | 3.9% |
| ् | 3 | 3.9% |
| प | 3 | 3.9% |
| अ | 3 | 3.9% |
| Other values (21) | 30 |
Hiragana
| Value | Count | Frequency (%) |
| の | 4 | |
| さ | 1 | 5.0% |
| ん | 1 | 5.0% |
| と | 1 | 5.0% |
| そ | 1 | 5.0% |
| め | 1 | 5.0% |
| ひ | 1 | 5.0% |
| ち | 1 | 5.0% |
| ず | 1 | 5.0% |
| か | 1 | 5.0% |
| Other values (7) | 7 |
Telugu
| Value | Count | Frequency (%) |
| ి | 4 | |
| ్ | 3 | |
| ు | 3 | |
| ద | 3 | |
| ా | 2 | 6.7% |
| న | 2 | 6.7% |
| స | 2 | 6.7% |
| మ | 2 | 6.7% |
| ర | 2 | 6.7% |
| బ | 1 | 3.3% |
| Other values (6) | 6 |
Tamil
| Value | Count | Frequency (%) |
| ் | 3 | |
| ம | 2 | |
| ர | 2 | |
| ு | 2 | |
| ப | 2 | |
| ன | 1 | 5.3% |
| வ | 1 | 5.3% |
| த | 1 | 5.3% |
| ஆ | 1 | 5.3% |
| ய | 1 | 5.3% |
| Other values (3) | 3 |
Han
| Value | Count | Frequency (%) |
| 俣 | 1 | |
| 界 | 1 | |
| 患 | 1 | |
| 者 | 1 | |
| 世 | 1 | |
| 水 | 1 | |
| 鬼 | 1 | |
| 見 | 1 | |
| 難 | 1 | |
| 海 | 1 |
Hangul
| Value | Count | Frequency (%) |
| 사 | 2 | |
| 회 | 1 | |
| 식 | 1 | |
| 주 | 1 | |
| 기 | 1 | |
| 찾 | 1 | |
| 랑 | 1 | |
| 첫 | 1 |
Thai
| Value | Count | Frequency (%) |
| ่ | 2 | |
| ง | 1 | |
| ร | 1 | |
| พ | 1 | |
| แ | 1 | |
| ี | 1 | |
| ส | 1 |
Arabic
| Value | Count | Frequency (%) |
| م | 2 | |
| ہ | 1 | |
| ت | 1 |
Inherited
| Value | Count | Frequency (%) |
| ́ | 4 | |
| ̈ | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14358111 | |
| Punctuation | 7270 | 0.1% |
| None | 5930 | < 0.1% |
| Cyrillic | 4587 | < 0.1% |
| Devanagari | 77 | < 0.1% |
| Telugu | 30 | < 0.1% |
| Hiragana | 20 | < 0.1% |
| Tamil | 19 | < 0.1% |
| Letterlike Symbols | 14 | < 0.1% |
| CJK | 10 | < 0.1% |
| Other values (11) | 41 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2406350 | ||
| e | 1365872 | 9.5% |
| a | 940505 | 6.6% |
| t | 934766 | 6.5% |
| i | 852552 | 5.9% |
| o | 830911 | 5.8% |
| n | 822601 | 5.7% |
| s | 767854 | 5.3% |
| r | 745312 | 5.2% |
| h | 600810 | 4.2% |
| Other values (82) | 4090578 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 3847 | |
| – | 881 | 12.1% |
| ” | 690 | 9.5% |
| “ | 672 | 9.2% |
| — | 633 | 8.7% |
| … | 303 | 4.2% |
| ‘ | 192 | 2.6% |
| | 31 | 0.4% |
| 7 | 0.1% | |
| ― | 5 | 0.1% |
| Other values (4) | 9 | 0.1% |
None
| Value | Count | Frequency (%) |
| é | 1552 | |
| ä | 294 | 5.0% |
| á | 293 | 4.9% |
| ö | 250 | 4.2% |
| í | 243 | 4.1% |
| è | 209 | 3.5% |
| ü | 178 | 3.0% |
| ı | 165 | 2.8% |
| ó | 164 | 2.8% |
| ç | 158 | 2.7% |
| Other values (141) | 2424 |
Cyrillic
| Value | Count | Frequency (%) |
| о | 470 | 10.2% |
| е | 404 | 8.8% |
| а | 373 | 8.1% |
| н | 323 | 7.0% |
| и | 299 | 6.5% |
| т | 265 | 5.8% |
| р | 240 | 5.2% |
| с | 218 | 4.8% |
| в | 173 | 3.8% |
| л | 161 | 3.5% |
| Other values (46) | 1661 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 14 |
Devanagari
| Value | Count | Frequency (%) |
| ा | 11 | 14.3% |
| न | 6 | 7.8% |
| र | 6 | 7.8% |
| म | 5 | 6.5% |
| ी | 4 | 5.2% |
| द | 3 | 3.9% |
| ो | 3 | 3.9% |
| ् | 3 | 3.9% |
| प | 3 | 3.9% |
| अ | 3 | 3.9% |
| Other values (21) | 30 |
Alphabetic PF
| Value | Count | Frequency (%) |
| fi | 4 |
Hiragana
| Value | Count | Frequency (%) |
| の | 4 | |
| さ | 1 | 5.0% |
| ん | 1 | 5.0% |
| と | 1 | 5.0% |
| そ | 1 | 5.0% |
| め | 1 | 5.0% |
| ひ | 1 | 5.0% |
| ち | 1 | 5.0% |
| ず | 1 | 5.0% |
| か | 1 | 5.0% |
| Other values (7) | 7 |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 4 | |
| ̈ | 3 |
Telugu
| Value | Count | Frequency (%) |
| ి | 4 | |
| ్ | 3 | |
| ు | 3 | |
| ద | 3 | |
| ా | 2 | 6.7% |
| న | 2 | 6.7% |
| స | 2 | 6.7% |
| మ | 2 | 6.7% |
| ర | 2 | 6.7% |
| బ | 1 | 3.3% |
| Other values (6) | 6 |
Tamil
| Value | Count | Frequency (%) |
| ் | 3 | |
| ம | 2 | |
| ர | 2 | |
| ு | 2 | |
| ப | 2 | |
| ன | 1 | 5.3% |
| வ | 1 | 5.3% |
| த | 1 | 5.3% |
| ஆ | 1 | 5.3% |
| ய | 1 | 5.3% |
| Other values (3) | 3 |
Arabic
| Value | Count | Frequency (%) |
| م | 2 | |
| ہ | 1 | |
| ت | 1 |
Hangul
| Value | Count | Frequency (%) |
| 사 | 2 | |
| 회 | 1 | |
| 식 | 1 | |
| 주 | 1 | |
| 기 | 1 | |
| 찾 | 1 | |
| 랑 | 1 | |
| 첫 | 1 |
Number Forms
| Value | Count | Frequency (%) |
| Ⅱ | 2 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʼ | 2 |
Thai
| Value | Count | Frequency (%) |
| ่ | 2 | |
| ง | 1 | |
| ร | 1 | |
| พ | 1 | |
| แ | 1 | |
| ี | 1 | |
| ส | 1 |
CJK
| Value | Count | Frequency (%) |
| 俣 | 1 | |
| 界 | 1 | |
| 患 | 1 | |
| 者 | 1 | |
| 世 | 1 | |
| 水 | 1 | |
| 鬼 | 1 | |
| 見 | 1 | |
| 難 | 1 | |
| 海 | 1 |
Math Operators
| Value | Count | Frequency (%) |
| − | 1 |
Katakana
| Value | Count | Frequency (%) |
| ・ | 1 |
Currency Symbols
| Value | Count | Frequency (%) |
| ₹ | 1 | |
| € | 1 |
Specials
| Value | Count | Frequency (%) |
| � | 1 |
Popularity
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 43731 |
|---|---|
| Distinct (%) | 96.4% |
| Missing | 100 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.9264576 |
| Minimum | 0 |
|---|---|
| Maximum | 547.4883 |
| Zeros | 40 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.02079775 |
| Q1 | 0.3888395 |
| median | 1.1304545 |
| Q3 | 3.6916945 |
| 95-th percentile | 11.063627 |
| Maximum | 547.4883 |
| Range | 547.4883 |
| Interquartile range (IQR) | 3.302855 |
Descriptive statistics
| Standard deviation | 6.0096718 |
|---|---|
| Coefficient of variation (CV) | 2.0535653 |
| Kurtosis | 1923.6882 |
| Mean | 2.9264576 |
| Median Absolute Deviation (MAD) | 0.9676215 |
| Skewness | 29.215066 |
| Sum | 132790.94 |
| Variance | 36.116156 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 × 10-6 | 56 | 0.1% |
| 0.000308 | 42 | 0.1% |
| 0 | 40 | 0.1% |
| 0.00022 | 39 | 0.1% |
| 0.000844 | 38 | 0.1% |
| 0.001177 | 38 | 0.1% |
| 0.000578 | 38 | 0.1% |
| 0.002001 | 27 | 0.1% |
| 0.003013 | 21 | < 0.1% |
| 0.00353 | 19 | < 0.1% |
| Other values (43721) | 45018 | |
| (Missing) | 100 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 40 | |
| 1 × 10-6 | 56 | |
| 2 × 10-6 | 6 | < 0.1% |
| 3 × 10-6 | 6 | < 0.1% |
| 4 × 10-6 | 5 | < 0.1% |
| 5 × 10-6 | 1 | < 0.1% |
| 6 × 10-6 | 2 | < 0.1% |
| 7 × 10-6 | 1 | < 0.1% |
| 8 × 10-6 | 6 | < 0.1% |
| 9 × 10-6 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 547.488298 | 1 | |
| 294.337037 | 1 | |
| 287.253654 | 1 | |
| 228.032744 | 1 | |
| 213.849907 | 1 | |
| 187.860492 | 1 | |
| 185.330992 | 1 | |
| 185.070892 | 1 | |
| 183.870374 | 1 | |
| 154.801009 | 1 |
ProductionCompanies
Categorical
| Distinct | 22667 |
|---|---|
| Distinct (%) | 49.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 355.4 KiB |
| MissingValue | |
|---|---|
| Metro-Goldwyn-Mayer (MGM) | 742 |
| Warner Bros. | 540 |
| Paramount Pictures | 505 |
| Twentieth Century Fox Film Corporation | 439 |
| Other values (22662) |
Length
| Max length | 609 |
|---|---|
| Median length | 476 |
| Mean length | 33.778894 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1536129 |
|---|---|
| Distinct characters | 294 |
| Distinct categories | 17 ? |
| Distinct scripts | 6 ? |
| Distinct blocks | 6 ? |
Unique
| Unique | 20300 ? |
|---|---|
| Unique (%) | 44.6% |
Sample
| 1st row | Pixar Animation Studios |
|---|---|
| 2nd row | TriStar Pictures, Teitler Film, Interscope Communications |
| 3rd row | Warner Bros., Lancaster Gate |
| 4th row | Twentieth Century Fox Film Corporation |
| 5th row | Sandollar Productions, Touchstone Pictures |
Common Values
| Value | Count | Frequency (%) |
| MissingValue | 11896 | 26.2% |
| Metro-Goldwyn-Mayer (MGM) | 742 | 1.6% |
| Warner Bros. | 540 | 1.2% |
| Paramount Pictures | 505 | 1.1% |
| Twentieth Century Fox Film Corporation | 439 | 1.0% |
| Universal Pictures | 320 | 0.7% |
| RKO Radio Pictures | 247 | 0.5% |
| Columbia Pictures Corporation | 207 | 0.5% |
| Columbia Pictures | 146 | 0.3% |
| Mosfilm | 145 | 0.3% |
| Other values (22657) | 30289 |
Length
| Value | Count | Frequency (%) |
| missingvalue | 11896 | 6.3% |
| films | 9455 | 5.0% |
| pictures | 9267 | 4.9% |
| productions | 9059 | 4.8% |
| film | 6679 | 3.5% |
| entertainment | 5154 | 2.7% |
| corporation | 2189 | 1.2% |
| company | 1769 | 0.9% |
| warner | 1478 | 0.8% |
| bros | 1411 | 0.7% |
| Other values (18617) | 131220 |
Most occurring characters
| Value | Count | Frequency (%) |
| 144110 | 9.4% | |
| i | 130730 | 8.5% |
| e | 106540 | 6.9% |
| n | 101865 | 6.6% |
| a | 89039 | 5.8% |
| s | 86459 | 5.6% |
| o | 85292 | 5.6% |
| r | 83547 | 5.4% |
| t | 83433 | 5.4% |
| l | 63160 | 4.1% |
| Other values (284) | 561954 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1105978 | |
| Uppercase Letter | 222757 | 14.5% |
| Space Separator | 144115 | 9.4% |
| Other Punctuation | 45099 | 2.9% |
| Decimal Number | 4347 | 0.3% |
| Dash Punctuation | 4331 | 0.3% |
| Open Punctuation | 4328 | 0.3% |
| Close Punctuation | 4327 | 0.3% |
| Math Symbol | 662 | < 0.1% |
| Other Letter | 140 | < 0.1% |
| Other values (7) | 45 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 130730 | |
| e | 106540 | |
| n | 101865 | |
| a | 89039 | |
| s | 86459 | 7.8% |
| o | 85292 | 7.7% |
| r | 83547 | 7.6% |
| t | 83433 | 7.5% |
| l | 63160 | 5.7% |
| u | 55647 | 5.0% |
| Other values (102) | 220266 |
Other Letter
| Value | Count | Frequency (%) |
| 스 | 9 | 6.4% |
| 트 | 8 | 5.7% |
| 인 | 6 | 4.3% |
| 엔 | 5 | 3.6% |
| 주 | 5 | 3.6% |
| 터 | 5 | 3.6% |
| 먼 | 5 | 3.6% |
| 테 | 5 | 3.6% |
| 픽 | 4 | 2.9% |
| 로 | 3 | 2.1% |
| Other values (62) | 85 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 27880 | |
| F | 26362 | |
| M | 25257 | |
| C | 20585 | 9.2% |
| V | 14957 | 6.7% |
| S | 11911 | 5.3% |
| E | 9746 | 4.4% |
| A | 9547 | 4.3% |
| T | 9356 | 4.2% |
| B | 9001 | 4.0% |
| Other values (52) | 58155 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 37354 | |
| . | 5671 | 12.6% |
| & | 764 | 1.7% |
| / | 645 | 1.4% |
| ' | 451 | 1.0% |
| " | 133 | 0.3% |
| ! | 36 | 0.1% |
| % | 18 | < 0.1% |
| : | 9 | < 0.1% |
| @ | 5 | < 0.1% |
| Other values (6) | 13 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1034 | |
| 1 | 712 | |
| 0 | 641 | |
| 3 | 556 | |
| 4 | 481 | |
| 9 | 205 | 4.7% |
| 6 | 195 | 4.5% |
| 5 | 178 | 4.1% |
| 8 | 173 | 4.0% |
| 7 | 172 | 4.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4318 | |
| [ | 9 | 0.2% |
| ( | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4317 | |
| ] | 9 | 0.2% |
| ) | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 144110 | ||
| 5 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4329 | |
| – | 2 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 661 | |
| | | 1 | 0.2% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 23 | |
| ㈜ | 2 | 8.0% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 3 | |
| » | 3 |
Other Number
| Value | Count | Frequency (%) |
| ² | 1 | |
| ½ | 1 |
Control
| Value | Count | Frequency (%) |
| 4 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4 |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 3 |
Format
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1328332 | |
| Common | 207252 | 13.5% |
| Cyrillic | 373 | < 0.1% |
| Hangul | 115 | < 0.1% |
| Greek | 31 | < 0.1% |
| Han | 26 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 130730 | 9.8% |
| e | 106540 | 8.0% |
| n | 101865 | 7.7% |
| a | 89039 | 6.7% |
| s | 86459 | 6.5% |
| o | 85292 | 6.4% |
| r | 83547 | 6.3% |
| t | 83433 | 6.3% |
| l | 63160 | 4.8% |
| u | 55647 | 4.2% |
| Other values (99) | 442620 |
Hangul
| Value | Count | Frequency (%) |
| 스 | 9 | 7.8% |
| 트 | 8 | 7.0% |
| 인 | 6 | 5.2% |
| 엔 | 5 | 4.3% |
| 주 | 5 | 4.3% |
| 터 | 5 | 4.3% |
| 먼 | 5 | 4.3% |
| 테 | 5 | 4.3% |
| 픽 | 4 | 3.5% |
| 로 | 3 | 2.6% |
| Other values (43) | 60 |
Common
| Value | Count | Frequency (%) |
| 144110 | ||
| , | 37354 | 18.0% |
| . | 5671 | 2.7% |
| - | 4329 | 2.1% |
| ( | 4318 | 2.1% |
| ) | 4317 | 2.1% |
| 2 | 1034 | 0.5% |
| & | 764 | 0.4% |
| 1 | 712 | 0.3% |
| + | 661 | 0.3% |
| Other values (37) | 3982 | 1.9% |
Cyrillic
| Value | Count | Frequency (%) |
| и | 34 | 9.1% |
| о | 28 | 7.5% |
| а | 26 | 7.0% |
| л | 22 | 5.9% |
| н | 20 | 5.4% |
| м | 19 | 5.1% |
| т | 17 | 4.6% |
| с | 16 | 4.3% |
| е | 16 | 4.3% |
| ь | 16 | 4.3% |
| Other values (36) | 159 |
Greek
| Value | Count | Frequency (%) |
| ο | 3 | 9.7% |
| ν | 3 | 9.7% |
| Ε | 2 | 6.5% |
| λ | 2 | 6.5% |
| η | 2 | 6.5% |
| ι | 2 | 6.5% |
| τ | 2 | 6.5% |
| ρ | 2 | 6.5% |
| Κ | 2 | 6.5% |
| έ | 1 | 3.2% |
| Other values (10) | 10 |
Han
| Value | Count | Frequency (%) |
| 北 | 2 | 7.7% |
| 京 | 2 | 7.7% |
| 司 | 2 | 7.7% |
| 公 | 2 | 7.7% |
| 限 | 2 | 7.7% |
| 有 | 2 | 7.7% |
| 影 | 2 | 7.7% |
| 乐 | 1 | 3.8% |
| 安 | 1 | 3.8% |
| 电 | 1 | 3.8% |
| Other values (9) | 9 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1529899 | |
| None | 5711 | 0.4% |
| Cyrillic | 373 | < 0.1% |
| Hangul | 113 | < 0.1% |
| CJK | 26 | < 0.1% |
| Punctuation | 7 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 144110 | 9.4% | |
| i | 130730 | 8.5% |
| e | 106540 | 7.0% |
| n | 101865 | 6.7% |
| a | 89039 | 5.8% |
| s | 86459 | 5.7% |
| o | 85292 | 5.6% |
| r | 83547 | 5.5% |
| t | 83433 | 5.5% |
| l | 63160 | 4.1% |
| Other values (77) | 555724 |
None
| Value | Count | Frequency (%) |
| é | 3176 | |
| ó | 416 | 7.3% |
| á | 317 | 5.6% |
| í | 173 | 3.0% |
| ü | 154 | 2.7% |
| ñ | 150 | 2.6% |
| ô | 140 | 2.5% |
| ä | 137 | 2.4% |
| è | 136 | 2.4% |
| ö | 132 | 2.3% |
| Other values (76) | 780 | 13.7% |
Cyrillic
| Value | Count | Frequency (%) |
| и | 34 | 9.1% |
| о | 28 | 7.5% |
| а | 26 | 7.0% |
| л | 22 | 5.9% |
| н | 20 | 5.4% |
| м | 19 | 5.1% |
| т | 17 | 4.6% |
| с | 16 | 4.3% |
| е | 16 | 4.3% |
| ь | 16 | 4.3% |
| Other values (36) | 159 |
Hangul
| Value | Count | Frequency (%) |
| 스 | 9 | 8.0% |
| 트 | 8 | 7.1% |
| 인 | 6 | 5.3% |
| 엔 | 5 | 4.4% |
| 주 | 5 | 4.4% |
| 터 | 5 | 4.4% |
| 먼 | 5 | 4.4% |
| 테 | 5 | 4.4% |
| 픽 | 4 | 3.5% |
| 로 | 3 | 2.7% |
| Other values (42) | 58 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 3 | |
| – | 2 | |
| | 1 | 14.3% |
| • | 1 | 14.3% |
CJK
| Value | Count | Frequency (%) |
| 北 | 2 | 7.7% |
| 京 | 2 | 7.7% |
| 司 | 2 | 7.7% |
| 公 | 2 | 7.7% |
| 限 | 2 | 7.7% |
| 有 | 2 | 7.7% |
| 影 | 2 | 7.7% |
| 乐 | 1 | 3.8% |
| 安 | 1 | 3.8% |
| 电 | 1 | 3.8% |
| Other values (9) | 9 |
ProductionCountries
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 2390 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 355.4 KiB |
| US | |
|---|---|
| Missing values | |
| GB | |
| FR | 1653 |
| JP | 1356 |
| Other values (2385) |
Length
| Max length | 98 |
|---|---|
| Median length | 2 |
| Mean length | 4.5812077 |
| Min length | 2 |
Characters and Unicode
| Total characters | 208335 |
|---|---|
| Distinct characters | 42 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1764 ? |
|---|---|
| Unique (%) | 3.9% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
Common Values
| Value | Count | Frequency (%) |
| US | 17846 | |
| Missing values | 6214 | 13.7% |
| GB | 2235 | 4.9% |
| FR | 1653 | 3.6% |
| JP | 1356 | 3.0% |
| IT | 1029 | 2.3% |
| CA | 840 | 1.8% |
| DE | 749 | 1.6% |
| IN | 735 | 1.6% |
| RU | 734 | 1.6% |
| Other values (2380) | 12085 |
Length
| Value | Count | Frequency (%) |
| us | 21147 | |
| values | 6214 | 10.0% |
| missing | 6214 | 10.0% |
| gb | 4091 | 6.6% |
| fr | 3939 | 6.4% |
| de | 2254 | 3.6% |
| it | 2168 | 3.5% |
| ca | 1765 | 2.8% |
| jp | 1648 | 2.7% |
| es | 964 | 1.6% |
| Other values (154) | 11529 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 23041 | 11.1% |
| U | 23024 | 11.1% |
| s | 18739 | 9.0% |
| 16457 | 7.9% | |
| i | 12622 | 6.1% |
| , | 10243 | 4.9% |
| R | 6686 | 3.2% |
| M | 6660 | 3.2% |
| u | 6408 | 3.1% |
| n | 6408 | 3.1% |
| Other values (32) | 78047 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 105321 | |
| Lowercase Letter | 76314 | |
| Space Separator | 16457 | 7.9% |
| Other Punctuation | 10243 | 4.9% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 23041 | |
| U | 23024 | |
| R | 6686 | 6.3% |
| M | 6660 | 6.3% |
| B | 4982 | 4.7% |
| E | 4752 | 4.5% |
| G | 4448 | 4.2% |
| F | 4342 | 4.1% |
| I | 4010 | 3.8% |
| A | 3136 | 3.0% |
| Other values (16) | 20240 |
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 18739 | |
| i | 12622 | |
| u | 6408 | 8.4% |
| n | 6408 | 8.4% |
| e | 6311 | 8.3% |
| a | 6214 | 8.1% |
| l | 6214 | 8.1% |
| v | 6214 | 8.1% |
| g | 6214 | 8.1% |
| o | 388 | 0.5% |
| Other values (4) | 582 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 16457 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 10243 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 181635 | |
| Common | 26700 | 12.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 23041 | 12.7% |
| U | 23024 | 12.7% |
| s | 18739 | 10.3% |
| i | 12622 | 6.9% |
| R | 6686 | 3.7% |
| M | 6660 | 3.7% |
| u | 6408 | 3.5% |
| n | 6408 | 3.5% |
| e | 6311 | 3.5% |
| a | 6214 | 3.4% |
| Other values (30) | 65522 |
Common
| Value | Count | Frequency (%) |
| 16457 | ||
| , | 10243 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 208335 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 23041 | 11.1% |
| U | 23024 | 11.1% |
| s | 18739 | 9.0% |
| 16457 | 7.9% | |
| i | 12622 | 6.1% |
| , | 10243 | 4.9% |
| R | 6686 | 3.2% |
| M | 6660 | 3.2% |
| u | 6408 | 3.1% |
| n | 6408 | 3.1% |
| Other values (32) | 78047 |
ReleaseDate
Categorical
| Distinct | 17334 |
|---|---|
| Distinct (%) | 38.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 355.4 KiB |
| 2008-01-01 | 136 |
|---|---|
| 2009-01-01 | 121 |
| 2007-01-01 | 118 |
| 2005-01-01 | 111 |
| 2006-01-01 | 101 |
| Other values (17329) |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 10.006597 |
| Min length | 10 |
Characters and Unicode
| Total characters | 455060 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 8570 ? |
|---|---|
| Unique (%) | 18.8% |
Sample
| 1st row | 1995-10-30 |
|---|---|
| 2nd row | 1995-12-15 |
| 3rd row | 1995-12-22 |
| 4th row | 1995-12-22 |
| 5th row | 1995-02-10 |
Common Values
| Value | Count | Frequency (%) |
| 2008-01-01 | 136 | 0.3% |
| 2009-01-01 | 121 | 0.3% |
| 2007-01-01 | 118 | 0.3% |
| 2005-01-01 | 111 | 0.2% |
| 2006-01-01 | 101 | 0.2% |
| NoReleaseDate | 100 | 0.2% |
| 2002-01-01 | 96 | 0.2% |
| 2004-01-01 | 90 | 0.2% |
| 2001-01-01 | 84 | 0.2% |
| 2003-01-01 | 76 | 0.2% |
| Other values (17324) | 44443 |
Length
| Value | Count | Frequency (%) |
| 2008-01-01 | 136 | 0.3% |
| 2009-01-01 | 121 | 0.3% |
| 2007-01-01 | 118 | 0.3% |
| 2005-01-01 | 111 | 0.2% |
| 2006-01-01 | 101 | 0.2% |
| noreleasedate | 100 | 0.2% |
| 2002-01-01 | 96 | 0.2% |
| 2004-01-01 | 90 | 0.2% |
| 2001-01-01 | 84 | 0.2% |
| 2003-01-01 | 76 | 0.2% |
| Other values (17324) | 44443 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 97600 | |
| - | 90752 | |
| 1 | 84054 | |
| 2 | 52803 | |
| 9 | 39773 | |
| 3 | 15435 | 3.4% |
| 8 | 15279 | 3.4% |
| 6 | 15021 | 3.3% |
| 5 | 14836 | 3.3% |
| 7 | 14289 | 3.1% |
| Other values (10) | 15218 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 363008 | |
| Dash Punctuation | 90752 | 19.9% |
| Lowercase Letter | 1000 | 0.2% |
| Uppercase Letter | 300 | 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 97600 | |
| 1 | 84054 | |
| 2 | 52803 | |
| 9 | 39773 | |
| 3 | 15435 | 4.3% |
| 8 | 15279 | 4.2% |
| 6 | 15021 | 4.1% |
| 5 | 14836 | 4.1% |
| 7 | 14289 | 3.9% |
| 4 | 13918 | 3.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 400 | |
| a | 200 | |
| l | 100 | 10.0% |
| s | 100 | 10.0% |
| t | 100 | 10.0% |
| o | 100 | 10.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 100 | |
| R | 100 | |
| D | 100 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 90752 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 453760 | |
| Latin | 1300 | 0.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 97600 | |
| - | 90752 | |
| 1 | 84054 | |
| 2 | 52803 | |
| 9 | 39773 | |
| 3 | 15435 | 3.4% |
| 8 | 15279 | 3.4% |
| 6 | 15021 | 3.3% |
| 5 | 14836 | 3.3% |
| 7 | 14289 | 3.1% |
Latin
| Value | Count | Frequency (%) |
| e | 400 | |
| a | 200 | |
| N | 100 | 7.7% |
| R | 100 | 7.7% |
| l | 100 | 7.7% |
| s | 100 | 7.7% |
| D | 100 | 7.7% |
| t | 100 | 7.7% |
| o | 100 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 455060 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 97600 | |
| - | 90752 | |
| 1 | 84054 | |
| 2 | 52803 | |
| 9 | 39773 | |
| 3 | 15435 | 3.4% |
| 8 | 15279 | 3.4% |
| 6 | 15021 | 3.3% |
| 5 | 14836 | 3.3% |
| 7 | 14289 | 3.1% |
| Other values (10) | 15218 | 3.3% |
Revenue
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 6863 |
|---|---|
| Distinct (%) | 15.1% |
| Missing | 97 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11229357 |
| Minimum | 0 |
|---|---|
| Maximum | 2.7879651 × 109 |
| Zeros | 37972 |
| Zeros (%) | 83.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 48018459 |
| Maximum | 2.7879651 × 109 |
| Range | 2.7879651 × 109 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 64387893 |
|---|---|
| Coefficient of variation (CV) | 5.7338897 |
| Kurtosis | 237.09288 |
| Mean | 11229357 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 12.255124 |
| Sum | 5.0957698 × 1011 |
| Variance | 4.1458008 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 37972 | |
| 12000000 | 20 | < 0.1% |
| 10000000 | 19 | < 0.1% |
| 11000000 | 19 | < 0.1% |
| 2000000 | 18 | < 0.1% |
| 6000000 | 17 | < 0.1% |
| 5000000 | 14 | < 0.1% |
| 500000 | 13 | < 0.1% |
| 8000000 | 13 | < 0.1% |
| 14000000 | 12 | < 0.1% |
| Other values (6853) | 7262 | 16.0% |
| (Missing) | 97 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 37972 | |
| 1 | 12 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 5 | < 0.1% |
| 6 | 2 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 5 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2787965087 | 1 | |
| 2068223624 | 1 | |
| 1845034188 | 1 | |
| 1519557910 | 1 | |
| 1513528810 | 1 | |
| 1506249360 | 1 | |
| 1405403694 | 1 | |
| 1342000000 | 1 | |
| 1274219009 | 1 | |
| 1262886337 | 1 |
Runtime
Real number (ℝ)
| Distinct | 353 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 346 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 94.181675 |
| Minimum | 0 |
|---|---|
| Maximum | 1256 |
| Zeros | 1535 |
| Zeros (%) | 3.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 85 |
| median | 95 |
| Q3 | 107 |
| 95-th percentile | 138 |
| Maximum | 1256 |
| Range | 1256 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 38.341059 |
|---|---|
| Coefficient of variation (CV) | 0.4070968 |
| Kurtosis | 93.925543 |
| Mean | 94.181675 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 4.4907363 |
| Sum | 4250419 |
| Variance | 1470.0368 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 2549 | 5.6% |
| 0 | 1535 | 3.4% |
| 100 | 1470 | 3.2% |
| 95 | 1410 | 3.1% |
| 93 | 1214 | 2.7% |
| 96 | 1104 | 2.4% |
| 92 | 1079 | 2.4% |
| 94 | 1062 | 2.3% |
| 91 | 1055 | 2.3% |
| 88 | 1030 | 2.3% |
| Other values (343) | 31622 |
| Value | Count | Frequency (%) |
| 0 | 1535 | |
| 1 | 107 | 0.2% |
| 2 | 33 | 0.1% |
| 3 | 48 | 0.1% |
| 4 | 50 | 0.1% |
| 5 | 51 | 0.1% |
| 6 | 72 | 0.2% |
| 7 | 103 | 0.2% |
| 8 | 78 | 0.2% |
| 9 | 63 | 0.1% |
| Value | Count | Frequency (%) |
| 1256 | 1 | |
| 1140 | 2 | |
| 931 | 1 | |
| 925 | 1 | |
| 900 | 1 | |
| 877 | 1 | |
| 874 | 1 | |
| 840 | 2 | |
| 780 | 1 | |
| 720 | 1 |
Tagline
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 20269 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 25078 |
| Missing (%) | 55.1% |
| Memory size | 355.4 KiB |
| Based on a true story. | 7 |
|---|---|
| Trust no one. | 4 |
| Be careful what you wish for. | 4 |
| - | 4 |
| How far would you go? | 3 |
| Other values (20264) |
Length
| Max length | 297 |
|---|---|
| Median length | 204 |
| Mean length | 46.999314 |
| Min length | 1 |
Characters and Unicode
| Total characters | 958692 |
|---|---|
| Distinct characters | 170 |
| Distinct categories | 17 ? |
| Distinct scripts | 6 ? |
| Distinct blocks | 10 ? |
Unique
| Unique | 20163 ? |
|---|---|
| Unique (%) | 98.8% |
Sample
| 1st row | Roll the dice and unleash the excitement! |
|---|---|
| 2nd row | Still Yelling. Still Fighting. Still Ready for Love. |
| 3rd row | Friends are the people who let you be yourself... and never let you forget it. |
| 4th row | Just When His World Is Back To Normal... He's In For The Surprise Of His Life! |
| 5th row | A Los Angeles Crime Saga |
Common Values
| Value | Count | Frequency (%) |
| Based on a true story. | 7 | < 0.1% |
| Trust no one. | 4 | < 0.1% |
| Be careful what you wish for. | 4 | < 0.1% |
| - | 4 | < 0.1% |
| How far would you go? | 3 | < 0.1% |
| Drama | 3 | < 0.1% |
| Classic Albums | 3 | < 0.1% |
| There are two sides to every love story. | 3 | < 0.1% |
| There is no turning back | 3 | < 0.1% |
| Documentary | 3 | < 0.1% |
| Other values (20259) | 20361 | |
| (Missing) | 25078 |
Length
| Value | Count | Frequency (%) |
| the | 10998 | 6.3% |
| a | 6815 | 3.9% |
| of | 4404 | 2.5% |
| to | 3584 | 2.1% |
| is | 2796 | 1.6% |
| in | 2693 | 1.5% |
| and | 2682 | 1.5% |
| you | 2389 | 1.4% |
| 1582 | 0.9% | |
| for | 1523 | 0.9% |
| Other values (15100) | 134470 |
Most occurring characters
| Value | Count | Frequency (%) |
| 153686 | ||
| e | 94412 | 9.8% |
| t | 57267 | 6.0% |
| o | 56566 | 5.9% |
| a | 51473 | 5.4% |
| n | 47498 | 5.0% |
| i | 46036 | 4.8% |
| r | 44992 | 4.7% |
| s | 42360 | 4.4% |
| h | 37172 | 3.9% |
| Other values (160) | 327230 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 680479 | |
| Space Separator | 153686 | 16.0% |
| Uppercase Letter | 74991 | 7.8% |
| Other Punctuation | 44585 | 4.7% |
| Decimal Number | 2687 | 0.3% |
| Dash Punctuation | 1944 | 0.2% |
| Final Punctuation | 98 | < 0.1% |
| Open Punctuation | 56 | < 0.1% |
| Close Punctuation | 55 | < 0.1% |
| Currency Symbol | 37 | < 0.1% |
| Other values (7) | 74 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 94412 | |
| t | 57267 | 8.4% |
| o | 56566 | 8.3% |
| a | 51473 | 7.6% |
| n | 47498 | 7.0% |
| i | 46036 | 6.8% |
| r | 44992 | 6.6% |
| s | 42360 | 6.2% |
| h | 37172 | 5.5% |
| l | 30174 | 4.4% |
| Other values (43) | 172529 |
Other Letter
| Value | Count | Frequency (%) |
| வ | 1 | 2.9% |
| ன | 1 | 2.9% |
| 成 | 1 | 2.9% |
| 劇 | 1 | 2.9% |
| 熟 | 1 | 2.9% |
| த | 1 | 2.9% |
| ஆ | 1 | 2.9% |
| 時 | 1 | 2.9% |
| 舞 | 1 | 2.9% |
| 場 | 1 | 2.9% |
| Other values (24) | 24 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 10009 | 13.3% |
| A | 6874 | 9.2% |
| S | 5652 | 7.5% |
| H | 4402 | 5.9% |
| I | 4387 | 5.9% |
| E | 4306 | 5.7% |
| W | 3681 | 4.9% |
| O | 3477 | 4.6% |
| N | 3195 | 4.3% |
| L | 3194 | 4.3% |
| Other values (20) | 25814 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 26647 | |
| ! | 5784 | 13.0% |
| ' | 5674 | 12.7% |
| , | 4226 | 9.5% |
| ? | 1161 | 2.6% |
| " | 582 | 1.3% |
| … | 148 | 0.3% |
| : | 138 | 0.3% |
| & | 83 | 0.2% |
| * | 42 | 0.1% |
| Other values (7) | 100 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 802 | |
| 1 | 516 | |
| 2 | 299 | 11.1% |
| 3 | 208 | 7.7% |
| 9 | 208 | 7.7% |
| 5 | 168 | 6.3% |
| 4 | 140 | 5.2% |
| 6 | 121 | 4.5% |
| 7 | 121 | 4.5% |
| 8 | 104 | 3.9% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 5 | |
| = | 5 | |
| | | 2 | 14.3% |
| ~ | 1 | 7.1% |
| − | 1 | 7.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1927 | |
| – | 9 | 0.5% |
| — | 8 | 0.4% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 82 | |
| ” | 15 | 15.3% |
| » | 1 | 1.0% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 14 | |
| ‘ | 4 | 21.1% |
| « | 1 | 5.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 49 | |
| [ | 7 | 12.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 48 | |
| ] | 7 | 12.7% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 2 | |
| ² | 1 |
Modifier Letter
| Value | Count | Frequency (%) |
| ˌ | 1 | |
| ˈ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 153686 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 37 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ் | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 755470 | |
| Common | 203187 | 21.2% |
| Han | 21 | < 0.1% |
| Tamil | 5 | < 0.1% |
| Hiragana | 5 | < 0.1% |
| Katakana | 4 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 94412 | 12.5% |
| t | 57267 | 7.6% |
| o | 56566 | 7.5% |
| a | 51473 | 6.8% |
| n | 47498 | 6.3% |
| i | 46036 | 6.1% |
| r | 44992 | 6.0% |
| s | 42360 | 5.6% |
| h | 37172 | 4.9% |
| l | 30174 | 4.0% |
| Other values (73) | 247520 |
Common
| Value | Count | Frequency (%) |
| 153686 | ||
| . | 26647 | 13.1% |
| ! | 5784 | 2.8% |
| ' | 5674 | 2.8% |
| , | 4226 | 2.1% |
| - | 1927 | 0.9% |
| ? | 1161 | 0.6% |
| 0 | 802 | 0.4% |
| " | 582 | 0.3% |
| 1 | 516 | 0.3% |
| Other values (42) | 2182 | 1.1% |
Han
| Value | Count | Frequency (%) |
| 成 | 1 | 4.8% |
| 劇 | 1 | 4.8% |
| 熟 | 1 | 4.8% |
| 時 | 1 | 4.8% |
| 舞 | 1 | 4.8% |
| 場 | 1 | 4.8% |
| 版 | 1 | 4.8% |
| 蜜 | 1 | 4.8% |
| 最 | 1 | 4.8% |
| 后 | 1 | 4.8% |
| Other values (11) | 11 |
Tamil
| Value | Count | Frequency (%) |
| வ | 1 | |
| ் | 1 | |
| ன | 1 | |
| த | 1 | |
| ஆ | 1 |
Hiragana
| Value | Count | Frequency (%) |
| は | 1 | |
| し | 1 | |
| て | 1 | |
| い | 1 | |
| る | 1 |
Katakana
| Value | Count | Frequency (%) |
| ク | 1 | |
| ラ | 1 | |
| ナ | 1 | |
| ド | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 958262 | |
| Punctuation | 280 | < 0.1% |
| None | 110 | < 0.1% |
| CJK | 21 | < 0.1% |
| Tamil | 5 | < 0.1% |
| Hiragana | 5 | < 0.1% |
| Katakana | 4 | < 0.1% |
| IPA Ext | 2 | < 0.1% |
| Modifier Letters | 2 | < 0.1% |
| Math Operators | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 153686 | ||
| e | 94412 | 9.9% |
| t | 57267 | 6.0% |
| o | 56566 | 5.9% |
| a | 51473 | 5.4% |
| n | 47498 | 5.0% |
| i | 46036 | 4.8% |
| r | 44992 | 4.7% |
| s | 42360 | 4.4% |
| h | 37172 | 3.9% |
| Other values (78) | 326800 |
Punctuation
| Value | Count | Frequency (%) |
| … | 148 | |
| ’ | 82 | |
| ” | 15 | 5.4% |
| “ | 14 | 5.0% |
| – | 9 | 3.2% |
| — | 8 | 2.9% |
| ‘ | 4 | 1.4% |
None
| Value | Count | Frequency (%) |
| é | 18 | |
| ä | 16 | |
| ö | 8 | 7.3% |
| á | 6 | 5.5% |
| ó | 6 | 5.5% |
| ü | 5 | 4.5% |
| í | 5 | 4.5% |
| ı | 5 | 4.5% |
| · | 4 | 3.6% |
| ć | 3 | 2.7% |
| Other values (26) | 34 |
IPA Ext
| Value | Count | Frequency (%) |
| ə | 2 |
Tamil
| Value | Count | Frequency (%) |
| வ | 1 | |
| ் | 1 | |
| ன | 1 | |
| த | 1 | |
| ஆ | 1 |
CJK
| Value | Count | Frequency (%) |
| 成 | 1 | 4.8% |
| 劇 | 1 | 4.8% |
| 熟 | 1 | 4.8% |
| 時 | 1 | 4.8% |
| 舞 | 1 | 4.8% |
| 場 | 1 | 4.8% |
| 版 | 1 | 4.8% |
| 蜜 | 1 | 4.8% |
| 最 | 1 | 4.8% |
| 后 | 1 | 4.8% |
| Other values (11) | 11 |
Katakana
| Value | Count | Frequency (%) |
| ク | 1 | |
| ラ | 1 | |
| ナ | 1 | |
| ド | 1 |
Modifier Letters
| Value | Count | Frequency (%) |
| ˌ | 1 | |
| ˈ | 1 |
Hiragana
| Value | Count | Frequency (%) |
| は | 1 | |
| し | 1 | |
| て | 1 | |
| い | 1 | |
| る | 1 |
Math Operators
| Value | Count | Frequency (%) |
| − | 1 |
Title
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 42197 |
|---|---|
| Distinct (%) | 92.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 355.4 KiB |
| NoTitle | 100 |
|---|---|
| Cinderella | 11 |
| Alice in Wonderland | 9 |
| Hamlet | 9 |
| Les Misérables | 8 |
| Other values (42192) |
Length
| Max length | 105 |
|---|---|
| Median length | 79 |
| Mean length | 16.680447 |
| Min length | 1 |
Characters and Unicode
| Total characters | 758560 |
|---|---|
| Distinct characters | 287 |
| Distinct categories | 17 ? |
| Distinct scripts | 7 ? |
| Distinct blocks | 12 ? |
Unique
| Unique | 39869 ? |
|---|---|
| Unique (%) | 87.7% |
Sample
| 1st row | Toy Story |
|---|---|
| 2nd row | Jumanji |
| 3rd row | Grumpier Old Men |
| 4th row | Waiting to Exhale |
| 5th row | Father of the Bride Part II |
Common Values
| Value | Count | Frequency (%) |
| NoTitle | 100 | 0.2% |
| Cinderella | 11 | < 0.1% |
| Alice in Wonderland | 9 | < 0.1% |
| Hamlet | 9 | < 0.1% |
| Les Misérables | 8 | < 0.1% |
| Beauty and the Beast | 8 | < 0.1% |
| Treasure Island | 7 | < 0.1% |
| A Christmas Carol | 7 | < 0.1% |
| The Three Musketeers | 7 | < 0.1% |
| Blackout | 7 | < 0.1% |
| Other values (42187) | 45303 |
Length
| Value | Count | Frequency (%) |
| the | 14555 | 10.7% |
| of | 4930 | 3.6% |
| a | 2241 | 1.6% |
| in | 1693 | 1.2% |
| and | 1631 | 1.2% |
| to | 1054 | 0.8% |
| 757 | 0.6% | |
| man | 665 | 0.5% |
| love | 664 | 0.5% |
| for | 601 | 0.4% |
| Other values (24354) | 107490 |
Most occurring characters
| Value | Count | Frequency (%) |
| 90827 | 12.0% | |
| e | 76351 | 10.1% |
| a | 48940 | 6.5% |
| o | 45771 | 6.0% |
| n | 40817 | 5.4% |
| r | 40018 | 5.3% |
| i | 39864 | 5.3% |
| t | 36822 | 4.9% |
| s | 29519 | 3.9% |
| h | 28516 | 3.8% |
| Other values (277) | 281115 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 534634 | |
| Uppercase Letter | 117465 | 15.5% |
| Space Separator | 90827 | 12.0% |
| Other Punctuation | 10489 | 1.4% |
| Decimal Number | 3850 | 0.5% |
| Dash Punctuation | 981 | 0.1% |
| Close Punctuation | 87 | < 0.1% |
| Open Punctuation | 85 | < 0.1% |
| Final Punctuation | 38 | < 0.1% |
| Other Letter | 25 | < 0.1% |
| Other values (7) | 79 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 76351 | |
| a | 48940 | |
| o | 45771 | 8.6% |
| n | 40817 | 7.6% |
| r | 40018 | 7.5% |
| i | 39864 | 7.5% |
| t | 36822 | 6.9% |
| s | 29519 | 5.5% |
| h | 28516 | 5.3% |
| l | 26024 | 4.9% |
| Other values (121) | 121992 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 16119 | |
| S | 10336 | 8.8% |
| M | 8031 | 6.8% |
| B | 7659 | 6.5% |
| C | 7165 | 6.1% |
| A | 6785 | 5.8% |
| D | 6335 | 5.4% |
| L | 5872 | 5.0% |
| H | 5170 | 4.4% |
| W | 5166 | 4.4% |
| Other values (65) | 38827 |
Other Letter
| Value | Count | Frequency (%) |
| چ | 2 | 8.0% |
| ه | 2 | 8.0% |
| ی | 2 | 8.0% |
| ک | 2 | 8.0% |
| 傳 | 1 | 4.0% |
| 空 | 1 | 4.0% |
| 時 | 1 | 4.0% |
| 狗 | 1 | 4.0% |
| 貓 | 1 | 4.0% |
| ª | 1 | 4.0% |
| Other values (11) | 11 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 3717 | |
| ' | 2505 | |
| . | 1603 | |
| , | 1134 | 10.8% |
| ! | 647 | 6.2% |
| & | 458 | 4.4% |
| ? | 269 | 2.6% |
| / | 79 | 0.8% |
| * | 19 | 0.2% |
| # | 13 | 0.1% |
| Other values (8) | 45 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 861 | |
| 1 | 697 | |
| 0 | 616 | |
| 3 | 482 | |
| 9 | 230 | 6.0% |
| 4 | 229 | 5.9% |
| 5 | 225 | 5.8% |
| 7 | 193 | 5.0% |
| 8 | 161 | 4.2% |
| 6 | 156 | 4.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 17 | |
| × | 3 | 12.5% |
| ∞ | 1 | 4.2% |
| = | 1 | 4.2% |
| → | 1 | 4.2% |
| − | 1 | 4.2% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 12 | |
| ² | 3 | 15.8% |
| ³ | 2 | 10.5% |
| ⅓ | 1 | 5.3% |
| ⁴ | 1 | 5.3% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 3 | |
| ☆ | 2 | |
| ™ | 1 | 12.5% |
| ♡ | 1 | 12.5% |
| № | 1 | 12.5% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 18 | |
| ¢ | 2 | 9.5% |
| £ | 1 | 4.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 966 | |
| – | 15 | 1.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 82 | |
| ] | 5 | 5.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 80 | |
| [ | 5 | 5.9% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 37 | |
| ” | 1 | 2.6% |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 1 | |
| “ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 90827 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3 |
Format
| Value | Count | Frequency (%) |
| | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 651584 | |
| Common | 106436 | 14.0% |
| Cyrillic | 346 | < 0.1% |
| Greek | 170 | < 0.1% |
| Arabic | 11 | < 0.1% |
| Katakana | 8 | < 0.1% |
| Han | 5 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 76351 | 11.7% |
| a | 48940 | 7.5% |
| o | 45771 | 7.0% |
| n | 40817 | 6.3% |
| r | 40018 | 6.1% |
| i | 39864 | 6.1% |
| t | 36822 | 5.7% |
| s | 29519 | 4.5% |
| h | 28516 | 4.4% |
| l | 26024 | 4.0% |
| Other values (107) | 238942 |
Common
| Value | Count | Frequency (%) |
| 90827 | ||
| : | 3717 | 3.5% |
| ' | 2505 | 2.4% |
| . | 1603 | 1.5% |
| , | 1134 | 1.1% |
| - | 966 | 0.9% |
| 2 | 861 | 0.8% |
| 1 | 697 | 0.7% |
| ! | 647 | 0.6% |
| 0 | 616 | 0.6% |
| Other values (50) | 2863 | 2.7% |
Cyrillic
| Value | Count | Frequency (%) |
| е | 32 | 9.2% |
| о | 32 | 9.2% |
| а | 29 | 8.4% |
| н | 24 | 6.9% |
| и | 23 | 6.6% |
| р | 22 | 6.4% |
| к | 17 | 4.9% |
| с | 15 | 4.3% |
| в | 14 | 4.0% |
| л | 14 | 4.0% |
| Other values (38) | 124 |
Greek
| Value | Count | Frequency (%) |
| α | 20 | 11.8% |
| ι | 14 | 8.2% |
| ο | 14 | 8.2% |
| τ | 9 | 5.3% |
| λ | 8 | 4.7% |
| ά | 8 | 4.7% |
| ρ | 8 | 4.7% |
| ν | 7 | 4.1% |
| π | 6 | 3.5% |
| η | 6 | 3.5% |
| Other values (32) | 70 |
Katakana
| Value | Count | Frequency (%) |
| テ | 1 | |
| ポ | 1 | |
| ィ | 1 | |
| ス | 1 | |
| タ | 1 | |
| ン | 1 | |
| ァ | 1 | |
| フ | 1 |
Arabic
| Value | Count | Frequency (%) |
| چ | 2 | |
| ه | 2 | |
| ی | 2 | |
| ک | 2 | |
| س | 1 | |
| ا | 1 | |
| ج | 1 |
Han
| Value | Count | Frequency (%) |
| 傳 | 1 | |
| 空 | 1 | |
| 時 | 1 | |
| 狗 | 1 | |
| 貓 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 756995 | |
| None | 1124 | 0.1% |
| Cyrillic | 346 | < 0.1% |
| Punctuation | 62 | < 0.1% |
| Arabic | 11 | < 0.1% |
| Katakana | 8 | < 0.1% |
| CJK | 5 | < 0.1% |
| Misc Symbols | 3 | < 0.1% |
| Letterlike Symbols | 2 | < 0.1% |
| Math Operators | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 90827 | 12.0% | |
| e | 76351 | 10.1% |
| a | 48940 | 6.5% |
| o | 45771 | 6.0% |
| n | 40817 | 5.4% |
| r | 40018 | 5.3% |
| i | 39864 | 5.3% |
| t | 36822 | 4.9% |
| s | 29519 | 3.9% |
| h | 28516 | 3.8% |
| Other values (76) | 279550 |
None
| Value | Count | Frequency (%) |
| é | 218 | |
| ä | 127 | 11.3% |
| ö | 55 | 4.9% |
| è | 53 | 4.7% |
| ô | 44 | 3.9% |
| ü | 39 | 3.5% |
| ó | 37 | 3.3% |
| á | 35 | 3.1% |
| ı | 35 | 3.1% |
| í | 33 | 2.9% |
| Other values (108) | 448 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 37 | |
| – | 15 | |
| … | 5 | 8.1% |
| | 2 | 3.2% |
| ‘ | 1 | 1.6% |
| ” | 1 | 1.6% |
| “ | 1 | 1.6% |
Cyrillic
| Value | Count | Frequency (%) |
| е | 32 | 9.2% |
| о | 32 | 9.2% |
| а | 29 | 8.4% |
| н | 24 | 6.9% |
| и | 23 | 6.6% |
| р | 22 | 6.4% |
| к | 17 | 4.9% |
| с | 15 | 4.3% |
| в | 14 | 4.0% |
| л | 14 | 4.0% |
| Other values (38) | 124 |
Arabic
| Value | Count | Frequency (%) |
| چ | 2 | |
| ه | 2 | |
| ی | 2 | |
| ک | 2 | |
| س | 1 | |
| ا | 1 | |
| ج | 1 |
Misc Symbols
| Value | Count | Frequency (%) |
| ☆ | 2 | |
| ♡ | 1 |
CJK
| Value | Count | Frequency (%) |
| 傳 | 1 | |
| 空 | 1 | |
| 時 | 1 | |
| 狗 | 1 | |
| 貓 | 1 |
Number Forms
| Value | Count | Frequency (%) |
| ⅓ | 1 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 1 | |
| № | 1 |
Math Operators
| Value | Count | Frequency (%) |
| ∞ | 1 | |
| − | 1 |
Katakana
| Value | Count | Frequency (%) |
| テ | 1 | |
| ポ | 1 | |
| ィ | 1 | |
| ス | 1 | |
| タ | 1 | |
| ン | 1 | |
| ァ | 1 | |
| フ | 1 |
Arrows
| Value | Count | Frequency (%) |
| → | 1 |
VoteAverage
Real number (ℝ)
| Distinct | 92 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 100 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.62407 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 2947 |
| Zeros (%) | 6.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5 |
| median | 6 |
| Q3 | 6.8 |
| 95-th percentile | 7.8 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 1.8 |
Descriptive statistics
| Standard deviation | 1.9154225 |
|---|---|
| Coefficient of variation (CV) | 0.34057587 |
| Kurtosis | 2.5420547 |
| Mean | 5.62407 |
| Median Absolute Deviation (MAD) | 0.9 |
| Skewness | -1.524472 |
| Sum | 255197.8 |
| Variance | 3.6688434 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2947 | 6.5% |
| 6 | 2462 | 5.4% |
| 5 | 1998 | 4.4% |
| 7 | 1883 | 4.1% |
| 6.5 | 1722 | 3.8% |
| 6.3 | 1603 | 3.5% |
| 5.5 | 1381 | 3.0% |
| 5.8 | 1369 | 3.0% |
| 6.4 | 1350 | 3.0% |
| 6.7 | 1342 | 3.0% |
| Other values (82) | 27319 |
| Value | Count | Frequency (%) |
| 0 | 2947 | |
| 0.5 | 13 | < 0.1% |
| 0.7 | 1 | < 0.1% |
| 1 | 103 | 0.2% |
| 1.1 | 1 | < 0.1% |
| 1.2 | 4 | < 0.1% |
| 1.3 | 13 | < 0.1% |
| 1.4 | 5 | < 0.1% |
| 1.5 | 30 | 0.1% |
| 1.6 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 185 | |
| 9.8 | 1 | < 0.1% |
| 9.6 | 1 | < 0.1% |
| 9.5 | 18 | < 0.1% |
| 9.4 | 3 | < 0.1% |
| 9.3 | 18 | < 0.1% |
| 9.2 | 4 | < 0.1% |
| 9.1 | 2 | < 0.1% |
| 9 | 158 | |
| 8.9 | 7 | < 0.1% |
VoteCount
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 1820 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 100 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 110.09644 |
| Minimum | 0 |
|---|---|
| Maximum | 14075 |
| Zeros | 2849 |
| Zeros (%) | 6.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 10 |
| Q3 | 34 |
| 95-th percentile | 434 |
| Maximum | 14075 |
| Range | 14075 |
| Interquartile range (IQR) | 31 |
Descriptive statistics
| Standard deviation | 491.74289 |
|---|---|
| Coefficient of variation (CV) | 4.4664741 |
| Kurtosis | 150.92858 |
| Mean | 110.09644 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 10.440782 |
| Sum | 4995736 |
| Variance | 241811.07 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3242 | 7.1% |
| 2 | 3127 | 6.9% |
| 0 | 2849 | 6.3% |
| 3 | 2785 | 6.1% |
| 4 | 2478 | 5.4% |
| 5 | 2097 | 4.6% |
| 6 | 1747 | 3.8% |
| 7 | 1570 | 3.5% |
| 8 | 1359 | 3.0% |
| 9 | 1194 | 2.6% |
| Other values (1810) | 22928 |
| Value | Count | Frequency (%) |
| 0 | 2849 | |
| 1 | 3242 | |
| 2 | 3127 | |
| 3 | 2785 | |
| 4 | 2478 | |
| 5 | 2097 | |
| 6 | 1747 | |
| 7 | 1570 | |
| 8 | 1359 | |
| 9 | 1194 | 2.6% |
| Value | Count | Frequency (%) |
| 14075 | 1 | |
| 12269 | 1 | |
| 12114 | 1 | |
| 12000 | 1 | |
| 11444 | 1 | |
| 11187 | 1 | |
| 10297 | 1 | |
| 10014 | 1 | |
| 9678 | 1 | |
| 9634 | 1 |
ReleaseYear
Real number (ℝ)
| Distinct | 135 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 100 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1991.8812 |
| Minimum | 1874 |
|---|---|
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.4 KiB |
Quantile statistics
| Minimum | 1874 |
|---|---|
| 5-th percentile | 1941 |
| Q1 | 1978 |
| median | 2001 |
| Q3 | 2010 |
| 95-th percentile | 2015 |
| Maximum | 2020 |
| Range | 146 |
| Interquartile range (IQR) | 32 |
Descriptive statistics
| Standard deviation | 24.05536 |
|---|---|
| Coefficient of variation (CV) | 0.012076704 |
| Kurtosis | 0.84010576 |
| Mean | 1991.8812 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -1.2248636 |
| Sum | 90383601 |
| Variance | 578.66033 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2014 | 1974 | 4.3% |
| 2015 | 1905 | 4.2% |
| 2013 | 1889 | 4.2% |
| 2012 | 1722 | 3.8% |
| 2011 | 1667 | 3.7% |
| 2016 | 1604 | 3.5% |
| 2009 | 1586 | 3.5% |
| 2010 | 1501 | 3.3% |
| 2008 | 1473 | 3.2% |
| 2007 | 1320 | 2.9% |
| Other values (125) | 28735 |
| Value | Count | Frequency (%) |
| 1874 | 1 | < 0.1% |
| 1878 | 1 | < 0.1% |
| 1883 | 1 | < 0.1% |
| 1887 | 1 | < 0.1% |
| 1888 | 2 | < 0.1% |
| 1890 | 5 | < 0.1% |
| 1891 | 6 | |
| 1892 | 3 | < 0.1% |
| 1893 | 1 | < 0.1% |
| 1894 | 13 |
| Value | Count | Frequency (%) |
| 2020 | 1 | < 0.1% |
| 2018 | 5 | < 0.1% |
| 2017 | 532 | 1.2% |
| 2016 | 1604 | |
| 2015 | 1905 | |
| 2014 | 1974 | |
| 2013 | 1889 | |
| 2012 | 1722 | |
| 2011 | 1667 | |
| 2010 | 1501 |
ReleaseMonth
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 100 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.4590753 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 3.6281605 |
|---|---|
| Coefficient of variation (CV) | 0.56171515 |
| Kurtosis | -1.3247729 |
| Mean | 6.4590753 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.071880633 |
| Sum | 293087 |
| Variance | 13.163548 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 5912 | |
| 9 | 4838 | |
| 10 | 4615 | |
| 12 | 3786 | |
| 11 | 3661 | |
| 3 | 3553 | |
| 4 | 3453 | |
| 8 | 3394 | |
| 5 | 3339 | |
| 6 | 3153 | |
| Other values (2) | 5672 |
| Value | Count | Frequency (%) |
| 1 | 5912 | |
| 2 | 3032 | |
| 3 | 3553 | |
| 4 | 3453 | |
| 5 | 3339 | |
| 6 | 3153 | |
| 7 | 2640 | |
| 8 | 3394 | |
| 9 | 4838 | |
| 10 | 4615 |
| Value | Count | Frequency (%) |
| 12 | 3786 | |
| 11 | 3661 | |
| 10 | 4615 | |
| 9 | 4838 | |
| 8 | 3394 | |
| 7 | 2640 | |
| 6 | 3153 | |
| 5 | 3339 | |
| 4 | 3453 | |
| 3 | 3553 |
Return
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 5232 |
|---|---|
| Distinct (%) | 11.5% |
| Missing | 97 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 659.99915 |
| Minimum | 0 |
|---|---|
| Maximum | 12396383 |
| Zeros | 39998 |
| Zeros (%) | 88.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2.5353413 |
| Maximum | 12396383 |
| Range | 12396383 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 74690.825 |
|---|---|
| Coefficient of variation (CV) | 113.16806 |
| Kurtosis | 20674.324 |
| Mean | 659.99915 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 138.3341 |
| Sum | 29950101 |
| Variance | 5.5787194 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 39998 | |
| 1 | 20 | < 0.1% |
| 2 | 12 | < 0.1% |
| 4 | 11 | < 0.1% |
| 5 | 8 | < 0.1% |
| 2.5 | 7 | < 0.1% |
| 3 | 7 | < 0.1% |
| 1.333333333 | 7 | < 0.1% |
| 1.5 | 6 | < 0.1% |
| 0.25 | 4 | < 0.1% |
| Other values (5222) | 5299 | 11.7% |
| (Missing) | 97 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 39998 | |
| 5.217391304 × 10-7 | 1 | < 0.1% |
| 7.5 × 10-7 | 1 | < 0.1% |
| 9.375 × 10-7 | 1 | < 0.1% |
| 1.499133126 × 10-6 | 1 | < 0.1% |
| 1.8 × 10-6 | 1 | < 0.1% |
| 1.916666667 × 10-6 | 1 | < 0.1% |
| 3.5 × 10-6 | 1 | < 0.1% |
| 4 × 10-6 | 1 | < 0.1% |
| 5.111111111 × 10-6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 12396383 | 1 | |
| 8500000 | 1 | |
| 4197476.625 | 1 | |
| 2755584 | 1 | |
| 1018619.283 | 1 | |
| 1000000 | 1 | |
| 26881.72043 | 1 | |
| 12890.38667 | 1 | |
| 5330.33945 | 1 | |
| 4133.333333 | 1 |
Director
Categorical
| Distinct | 17573 |
|---|---|
| Distinct (%) | 38.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 355.4 KiB |
| [] | 887 |
|---|---|
| ['John Ford'] | 66 |
| ['Michael Curtiz'] | 65 |
| ['Werner Herzog'] | 54 |
| ['Alfred Hitchcock'] | 53 |
| Other values (17568) |
Length
| Max length | 37 |
|---|---|
| Median length | 33 |
| Mean length | 17.164153 |
| Min length | 2 |
Characters and Unicode
| Total characters | 780557 |
|---|---|
| Distinct characters | 203 |
| Distinct categories | 9 ? |
| Distinct scripts | 6 ? |
| Distinct blocks | 7 ? |
Unique
| Unique | 10622 ? |
|---|---|
| Unique (%) | 23.4% |
Sample
| 1st row | ['John Lasseter'] |
|---|---|
| 2nd row | ['Joe Johnston'] |
| 3rd row | ['Howard Deutch'] |
| 4th row | ['Forest Whitaker'] |
| 5th row | ['Charles Shyer'] |
Common Values
| Value | Count | Frequency (%) |
| [] | 887 | 2.0% |
| ['John Ford'] | 66 | 0.1% |
| ['Michael Curtiz'] | 65 | 0.1% |
| ['Werner Herzog'] | 54 | 0.1% |
| ['Alfred Hitchcock'] | 53 | 0.1% |
| ['Georges Méliès'] | 51 | 0.1% |
| ['Woody Allen'] | 49 | 0.1% |
| ['Jean-Luc Godard'] | 47 | 0.1% |
| ['Sidney Lumet'] | 46 | 0.1% |
| ['Charlie Chaplin'] | 44 | 0.1% |
| Other values (17563) | 44114 |
Length
| Value | Count | Frequency (%) |
| john | 1165 | 1.2% |
| 974 | 1.0% | |
| michael | 879 | 0.9% |
| robert | 806 | 0.9% |
| david | 806 | 0.9% |
| peter | 525 | 0.6% |
| william | 513 | 0.5% |
| richard | 511 | 0.5% |
| james | 489 | 0.5% |
| paul | 439 | 0.5% |
| Other values (17101) | 87607 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 89011 | 11.4% |
| e | 52337 | 6.7% |
| a | 51706 | 6.6% |
| 49250 | 6.3% | |
| [ | 45476 | 5.8% |
| ] | 45476 | 5.8% |
| r | 40797 | 5.2% |
| n | 40267 | 5.2% |
| i | 39006 | 5.0% |
| o | 35375 | 4.5% |
| Other values (193) | 291856 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 451377 | |
| Uppercase Letter | 95412 | 12.2% |
| Other Punctuation | 92297 | 11.8% |
| Space Separator | 49250 | 6.3% |
| Open Punctuation | 45478 | 5.8% |
| Close Punctuation | 45478 | 5.8% |
| Dash Punctuation | 1238 | 0.2% |
| Other Letter | 21 | < 0.1% |
| Decimal Number | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 52337 | |
| a | 51706 | |
| r | 40797 | 9.0% |
| n | 40267 | 8.9% |
| i | 39006 | 8.6% |
| o | 35375 | 7.8% |
| l | 27477 | 6.1% |
| s | 20792 | 4.6% |
| t | 19775 | 4.4% |
| h | 16708 | 3.7% |
| Other values (97) | 107137 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 8356 | 8.8% |
| S | 7933 | 8.3% |
| J | 7211 | 7.6% |
| R | 6167 | 6.5% |
| B | 5973 | 6.3% |
| C | 5961 | 6.2% |
| A | 5717 | 6.0% |
| D | 5104 | 5.3% |
| L | 4950 | 5.2% |
| G | 4566 | 4.8% |
| Other values (52) | 33474 |
Other Letter
| Value | Count | Frequency (%) |
| ی | 2 | 9.5% |
| ا | 2 | 9.5% |
| م | 2 | 9.5% |
| ع | 1 | 4.8% |
| 塩 | 1 | 4.8% |
| 谷 | 1 | 4.8% |
| 直 | 1 | 4.8% |
| 義 | 1 | 4.8% |
| ن | 1 | 4.8% |
| پ | 1 | 4.8% |
| Other values (8) | 8 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 89011 | |
| . | 2885 | 3.1% |
| " | 374 | 0.4% |
| , | 14 | < 0.1% |
| \ | 12 | < 0.1% |
| · | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 9 | 1 | 16.7% |
| 5 | 1 | 16.7% |
| 3 | 1 | 16.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 45476 | |
| ( | 2 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 45476 | |
| ) | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 49250 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1238 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 546645 | |
| Common | 233747 | |
| Cyrillic | 144 | < 0.1% |
| Arabic | 10 | < 0.1% |
| Han | 8 | < 0.1% |
| Hangul | 3 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 52337 | 9.6% |
| a | 51706 | 9.5% |
| r | 40797 | 7.5% |
| n | 40267 | 7.4% |
| i | 39006 | 7.1% |
| o | 35375 | 6.5% |
| l | 27477 | 5.0% |
| s | 20792 | 3.8% |
| t | 19775 | 3.6% |
| h | 16708 | 3.1% |
| Other values (123) | 202405 |
Cyrillic
| Value | Count | Frequency (%) |
| и | 19 | |
| о | 11 | 7.6% |
| е | 11 | 7.6% |
| л | 11 | 7.6% |
| р | 10 | 6.9% |
| а | 10 | 6.9% |
| к | 8 | 5.6% |
| н | 7 | 4.9% |
| в | 6 | 4.2% |
| д | 6 | 4.2% |
| Other values (26) | 45 |
Common
| Value | Count | Frequency (%) |
| ' | 89011 | |
| 49250 | ||
| [ | 45476 | |
| ] | 45476 | |
| . | 2885 | 1.2% |
| - | 1238 | 0.5% |
| " | 374 | 0.2% |
| , | 14 | < 0.1% |
| \ | 12 | < 0.1% |
| 0 | 3 | < 0.1% |
| Other values (6) | 8 | < 0.1% |
Han
| Value | Count | Frequency (%) |
| 塩 | 1 | |
| 谷 | 1 | |
| 直 | 1 | |
| 義 | 1 | |
| 玛 | 1 | |
| 森 | 1 | |
| 杰 | 1 | |
| 莫 | 1 |
Arabic
| Value | Count | Frequency (%) |
| ی | 2 | |
| ا | 2 | |
| م | 2 | |
| ع | 1 | |
| ن | 1 | |
| پ | 1 | |
| د | 1 |
Hangul
| Value | Count | Frequency (%) |
| 영 | 1 | |
| 진 | 1 | |
| 모 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 776503 | |
| None | 3886 | 0.5% |
| Cyrillic | 144 | < 0.1% |
| Arabic | 10 | < 0.1% |
| CJK | 8 | < 0.1% |
| Latin Ext Additional | 3 | < 0.1% |
| Hangul | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 89011 | 11.5% |
| e | 52337 | 6.7% |
| a | 51706 | 6.7% |
| 49250 | 6.3% | |
| [ | 45476 | 5.9% |
| ] | 45476 | 5.9% |
| r | 40797 | 5.3% |
| n | 40267 | 5.2% |
| i | 39006 | 5.0% |
| o | 35375 | 4.6% |
| Other values (57) | 287802 |
None
| Value | Count | Frequency (%) |
| é | 916 | |
| á | 379 | 9.8% |
| ö | 255 | 6.6% |
| ó | 229 | 5.9% |
| í | 228 | 5.9% |
| ô | 153 | 3.9% |
| ä | 149 | 3.8% |
| è | 134 | 3.4% |
| ü | 108 | 2.8% |
| ç | 106 | 2.7% |
| Other values (69) | 1229 |
Cyrillic
| Value | Count | Frequency (%) |
| и | 19 | |
| о | 11 | 7.6% |
| е | 11 | 7.6% |
| л | 11 | 7.6% |
| р | 10 | 6.9% |
| а | 10 | 6.9% |
| к | 8 | 5.6% |
| н | 7 | 4.9% |
| в | 6 | 4.2% |
| д | 6 | 4.2% |
| Other values (26) | 45 |
Arabic
| Value | Count | Frequency (%) |
| ی | 2 | |
| ا | 2 | |
| م | 2 | |
| ع | 1 | |
| ن | 1 | |
| پ | 1 | |
| د | 1 |
CJK
| Value | Count | Frequency (%) |
| 塩 | 1 | |
| 谷 | 1 | |
| 直 | 1 | |
| 義 | 1 | |
| 玛 | 1 | |
| 森 | 1 | |
| 杰 | 1 | |
| 莫 | 1 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ạ | 1 | |
| ễ | 1 | |
| ấ | 1 |
Hangul
| Value | Count | Frequency (%) |
| 영 | 1 | |
| 진 | 1 | |
| 모 | 1 |
Id
Real number (ℝ)
| Distinct | 45432 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 108346 |
| Minimum | 2 |
|---|---|
| Maximum | 469172 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.4 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 5419 |
| Q1 | 26443.25 |
| median | 60002.5 |
| Q3 | 157302 |
| 95-th percentile | 358552.75 |
| Maximum | 469172 |
| Range | 469170 |
| Interquartile range (IQR) | 130858.75 |
Descriptive statistics
| Standard deviation | 112443.8 |
|---|---|
| Coefficient of variation (CV) | 1.0378214 |
| Kurtosis | 0.54902668 |
| Mean | 108346 |
| Median Absolute Deviation (MAD) | 44528 |
| Skewness | 1.2798523 |
| Sum | 4.9271426 × 109 |
| Variance | 1.2643607 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 141971 | 3 | < 0.1% |
| 298721 | 2 | < 0.1% |
| 9755 | 2 | < 0.1% |
| 10991 | 2 | < 0.1% |
| 99080 | 2 | < 0.1% |
| 152795 | 2 | < 0.1% |
| 22649 | 2 | < 0.1% |
| 18440 | 2 | < 0.1% |
| 5511 | 2 | < 0.1% |
| 132641 | 2 | < 0.1% |
| Other values (45422) | 45455 |
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 3 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 11 | 1 | |
| 12 | 1 | |
| 13 | 1 | |
| 14 | 1 | |
| 15 | 1 | |
| 16 | 1 |
| Value | Count | Frequency (%) |
| 469172 | 1 | |
| 468707 | 1 | |
| 468343 | 1 | |
| 467731 | 1 | |
| 465044 | 1 | |
| 464819 | 1 | |
| 464207 | 1 | |
| 464111 | 1 | |
| 463906 | 1 | |
| 463800 | 1 |
MovieCharacter
Categorical
| Distinct | 40180 |
|---|---|
| Distinct (%) | 88.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 355.4 KiB |
| NoCharacter | 2570 |
|---|---|
| Himself | 516 |
| , , , | 211 |
| , , , , | 209 |
| , , | 141 |
| Other values (40175) |
Length
| Max length | 6647 |
|---|---|
| Median length | 1773 |
| Mean length | 168.8468 |
| Min length | 2 |
Characters and Unicode
| Total characters | 7678477 |
|---|---|
| Distinct characters | 618 |
| Distinct categories | 20 ? |
| Distinct scripts | 12 ? |
| Distinct blocks | 14 ? |
Unique
| Unique | 39945 ? |
|---|---|
| Unique (%) | 87.8% |
Sample
| 1st row | Woody (voice), Buzz Lightyear (voice), Mr. Potato Head (voice), Slinky Dog (voice), Rex (voice), Hamm (voice), Bo Peep (voice), Andy (voice), Sid (voice), Mrs. Davis (voice), Sergeant (voice), Hannah (voice), TV Announcer (voice) |
|---|---|
| 2nd row | Alan Parrish, Samuel Alan Parrish / Van Pelt, Judy Sheperd, Peter Shepherd, Sarah Whittle, Nora Shepherd, Carl Bentley, Carol Anne Parrish, Alan Parrish (young), Sarah Whittle (young), Exterminator, Mrs. Thomas the Realtor, Benjamin, Caleb, Billy Jessup, Cop, Bum, Jim Shepherd, Martha Shepherd, Gun Salesman, Paramedic, Paramedic, Girl, Girl, Baker, Pianist |
| 3rd row | Max Goldman, John Gustafson, Ariel Gustafson, Maria Sophia Coletta Ragetti, Melanie Gustafson, Grandpa Gustafson, Jacob Goldman |
| 4th row | Savannah 'Vannah' Jackson, Bernadine 'Bernie' Harris, Gloria 'Glo' Matthews, Robin Stokes, Marvin King, Kenneth Dawkins, John Harris, Sr., Troy, Joseph, James Wheeler |
| 5th row | George Banks, Nina Banks, Franck Eggelhoffer, Annie Banks-MacKenzie, Bryan MacKenzie, Matty Banks, Howard Weinstein, John MacKenzie, Joanna MacKenzie, Dr. Megan Eisenberg, Mr. Habib, Wife Mrs. Habib |
Common Values
| Value | Count | Frequency (%) |
| NoCharacter | 2570 | 5.7% |
| Himself | 516 | 1.1% |
| , , , | 211 | 0.5% |
| , , , , | 209 | 0.5% |
| , , | 141 | 0.3% |
| , , , , , | 129 | 0.3% |
| Narrator | 124 | 0.3% |
| , | 115 | 0.3% |
| , , , , , , | 107 | 0.2% |
| , , , , , , , | 85 | 0.2% |
| Other values (40170) | 41269 |
Length
| Value | Count | Frequency (%) |
| 37208 | 3.5% | |
| uncredited | 19404 | 1.8% |
| himself | 14232 | 1.3% |
| voice | 13783 | 1.3% |
| the | 10319 | 1.0% |
| dr | 6831 | 0.6% |
| mrs | 5580 | 0.5% |
| man | 5252 | 0.5% |
| mr | 5177 | 0.5% |
| girl | 4420 | 0.4% |
| Other values (130038) | 947609 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1028323 | 13.4% | |
| e | 649946 | 8.5% |
| , | 525726 | 6.8% |
| a | 519515 | 6.8% |
| r | 472136 | 6.1% |
| i | 419713 | 5.5% |
| n | 404639 | 5.3% |
| o | 360973 | 4.7% |
| t | 288080 | 3.8% |
| l | 273828 | 3.6% |
| Other values (608) | 2735598 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4977433 | |
| Space Separator | 1028323 | 13.4% |
| Uppercase Letter | 950790 | 12.4% |
| Other Punctuation | 608315 | 7.9% |
| Open Punctuation | 42194 | 0.5% |
| Close Punctuation | 42154 | 0.5% |
| Decimal Number | 14368 | 0.2% |
| Dash Punctuation | 13905 | 0.2% |
| Other Letter | 632 | < 0.1% |
| Final Punctuation | 141 | < 0.1% |
| Other values (10) | 222 | < 0.1% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| º | 25 | 4.0% |
| ا | 25 | 4.0% |
| ل | 17 | 2.7% |
| ي | 14 | 2.2% |
| ب | 12 | 1.9% |
| د | 9 | 1.4% |
| ל | 9 | 1.4% |
| ر | 9 | 1.4% |
| و | 8 | 1.3% |
| س | 8 | 1.3% |
| Other values (274) | 496 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 649946 | |
| a | 519515 | |
| r | 472136 | |
| i | 419713 | 8.4% |
| n | 404639 | 8.1% |
| o | 360973 | 7.3% |
| t | 288080 | 5.8% |
| l | 273828 | 5.5% |
| s | 242202 | 4.9% |
| d | 174401 | 3.5% |
| Other values (150) | 1172000 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 93697 | 9.9% |
| S | 80372 | 8.5% |
| C | 75971 | 8.0% |
| B | 61447 | 6.5% |
| D | 57096 | 6.0% |
| H | 55392 | 5.8% |
| P | 52540 | 5.5% |
| A | 50399 | 5.3% |
| G | 44764 | 4.7% |
| L | 44614 | 4.7% |
| Other values (95) | 334498 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 525726 | |
| . | 34275 | 5.6% |
| ' | 26738 | 4.4% |
| / | 10433 | 1.7% |
| # | 6037 | 1.0% |
| " | 4257 | 0.7% |
| : | 445 | 0.1% |
| & | 268 | < 0.1% |
| ! | 45 | < 0.1% |
| ? | 31 | < 0.1% |
| Other values (6) | 60 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5053 | |
| 2 | 4015 | |
| 3 | 1445 | 10.1% |
| 4 | 748 | 5.2% |
| 0 | 647 | 4.5% |
| 9 | 568 | 4.0% |
| 5 | 514 | 3.6% |
| 8 | 490 | 3.4% |
| 6 | 458 | 3.2% |
| 7 | 430 | 3.0% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 3 | |
| ̂ | 3 | |
| ่ | 2 | |
| ̀ | 1 | 7.1% |
| ּ | 1 | 7.1% |
| ี | 1 | 7.1% |
| ิ | 1 | 7.1% |
| ื | 1 | 7.1% |
| ๋ | 1 | 7.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 42047 | |
| [ | 121 | 0.3% |
| „ | 23 | 0.1% |
| ‚ | 3 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 42032 | |
| ] | 121 | 0.3% |
| ) | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 13876 | |
| – | 28 | 0.2% |
| — | 1 | < 0.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 83 | |
| » | 47 | |
| ” | 11 | 7.8% |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 47 | |
| “ | 33 | |
| ‘ | 7 | 8.0% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 24 | |
| № | 1 | 3.8% |
| ® | 1 | 3.8% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 6 | |
| + | 5 | |
| < | 1 | 8.3% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 28 | |
| ´ | 19 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 13 | |
| ¢ | 2 | 13.3% |
Control
| Value | Count | Frequency (%) |
| 8 | ||
| | 1 | 11.1% |
Format
| Value | Count | Frequency (%) |
| | 2 | |
| | 1 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 1 | |
| ² | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1028323 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5913946 | |
| Common | 1749608 | 22.8% |
| Cyrillic | 14096 | 0.2% |
| Hangul | 223 | < 0.1% |
| Greek | 212 | < 0.1% |
| Arabic | 156 | < 0.1% |
| Han | 117 | < 0.1% |
| Hebrew | 60 | < 0.1% |
| Thai | 26 | < 0.1% |
| Katakana | 23 | < 0.1% |
| Other values (2) | 10 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 649946 | 11.0% |
| a | 519515 | 8.8% |
| r | 472136 | 8.0% |
| i | 419713 | 7.1% |
| n | 404639 | 6.8% |
| o | 360973 | 6.1% |
| t | 288080 | 4.9% |
| l | 273828 | 4.6% |
| s | 242202 | 4.1% |
| d | 174401 | 2.9% |
| Other values (150) | 2108513 |
Hangul
| Value | Count | Frequency (%) |
| 진 | 7 | 3.1% |
| 영 | 6 | 2.7% |
| 최 | 6 | 2.7% |
| 동 | 5 | 2.2% |
| 유 | 5 | 2.2% |
| 이 | 5 | 2.2% |
| 은 | 4 | 1.8% |
| 정 | 4 | 1.8% |
| 희 | 4 | 1.8% |
| 사 | 4 | 1.8% |
| Other values (113) | 173 |
Han
| Value | Count | Frequency (%) |
| 大 | 5 | 4.3% |
| 爸 | 4 | 3.4% |
| 雄 | 4 | 3.4% |
| 子 | 3 | 2.6% |
| 蕭 | 2 | 1.7% |
| 智 | 2 | 1.7% |
| 心 | 2 | 1.7% |
| 柏 | 2 | 1.7% |
| 毒 | 2 | 1.7% |
| 相 | 2 | 1.7% |
| Other values (77) | 89 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 1497 | 10.6% |
| о | 1125 | 8.0% |
| и | 1040 | 7.4% |
| е | 968 | 6.9% |
| н | 924 | 6.6% |
| р | 909 | 6.4% |
| т | 631 | 4.5% |
| к | 613 | 4.3% |
| л | 600 | 4.3% |
| в | 547 | 3.9% |
| Other values (55) | 5242 |
Common
| Value | Count | Frequency (%) |
| 1028323 | ||
| , | 525726 | |
| ( | 42047 | 2.4% |
| ) | 42032 | 2.4% |
| . | 34275 | 2.0% |
| ' | 26738 | 1.5% |
| - | 13876 | 0.8% |
| / | 10433 | 0.6% |
| # | 6037 | 0.3% |
| 1 | 5053 | 0.3% |
| Other values (50) | 15068 | 0.9% |
Greek
| Value | Count | Frequency (%) |
| α | 24 | 11.3% |
| ς | 19 | 9.0% |
| ο | 19 | 9.0% |
| ρ | 14 | 6.6% |
| σ | 9 | 4.2% |
| τ | 8 | 3.8% |
| η | 8 | 3.8% |
| ν | 8 | 3.8% |
| ά | 8 | 3.8% |
| λ | 8 | 3.8% |
| Other values (32) | 87 |
Arabic
| Value | Count | Frequency (%) |
| ا | 25 | |
| ل | 17 | |
| ي | 14 | 9.0% |
| ب | 12 | 7.7% |
| د | 9 | 5.8% |
| ر | 9 | 5.8% |
| و | 8 | 5.1% |
| س | 8 | 5.1% |
| ن | 7 | 4.5% |
| ش | 6 | 3.8% |
| Other values (17) | 41 |
Hebrew
| Value | Count | Frequency (%) |
| ל | 9 | |
| א | 7 | |
| ו | 7 | |
| ה | 5 | |
| י | 5 | |
| ר | 4 | 6.7% |
| ם | 4 | 6.7% |
| ט | 3 | 5.0% |
| ש | 3 | 5.0% |
| נ | 2 | 3.3% |
| Other values (9) | 11 |
Thai
| Value | Count | Frequency (%) |
| เ | 3 | |
| อ | 3 | |
| ก | 2 | 7.7% |
| า | 2 | 7.7% |
| น | 2 | 7.7% |
| ่ | 2 | 7.7% |
| ม | 2 | 7.7% |
| ี | 1 | 3.8% |
| แ | 1 | 3.8% |
| ค | 1 | 3.8% |
| Other values (7) | 7 |
Katakana
| Value | Count | Frequency (%) |
| ロ | 4 | |
| ペ | 4 | |
| ニ | 2 | |
| ト | 2 | |
| ッ | 2 | |
| ク | 2 | |
| ラ | 2 | |
| マ | 1 | 4.3% |
| ピ | 1 | 4.3% |
| ゴ | 1 | 4.3% |
| Other values (2) | 2 |
Inherited
| Value | Count | Frequency (%) |
| ́ | 3 | |
| ̂ | 3 | |
| ̀ | 1 | 14.3% |
Hiragana
| Value | Count | Frequency (%) |
| り | 1 | |
| お | 1 | |
| ん | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7645701 | |
| None | 17870 | 0.2% |
| Cyrillic | 14096 | 0.2% |
| Hangul | 223 | < 0.1% |
| Punctuation | 191 | < 0.1% |
| Arabic | 156 | < 0.1% |
| CJK | 117 | < 0.1% |
| Hebrew | 60 | < 0.1% |
| Thai | 26 | < 0.1% |
| Katakana | 23 | < 0.1% |
| Other values (4) | 14 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1028323 | 13.4% | |
| e | 649946 | 8.5% |
| , | 525726 | 6.9% |
| a | 519515 | 6.8% |
| r | 472136 | 6.2% |
| i | 419713 | 5.5% |
| n | 404639 | 5.3% |
| o | 360973 | 4.7% |
| t | 288080 | 3.8% |
| l | 273828 | 3.6% |
| Other values (80) | 2702822 |
None
| Value | Count | Frequency (%) |
| é | 4956 | |
| è | 1621 | 9.1% |
| ä | 1166 | 6.5% |
| á | 1004 | 5.6% |
| í | 921 | 5.2% |
| ö | 822 | 4.6% |
| ô | 711 | 4.0% |
| ü | 700 | 3.9% |
| ó | 595 | 3.3% |
| ç | 511 | 2.9% |
| Other values (149) | 4863 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 1497 | 10.6% |
| о | 1125 | 8.0% |
| и | 1040 | 7.4% |
| е | 968 | 6.9% |
| н | 924 | 6.6% |
| р | 909 | 6.4% |
| т | 631 | 4.5% |
| к | 613 | 4.3% |
| л | 600 | 4.3% |
| в | 547 | 3.9% |
| Other values (55) | 5242 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 83 | |
| “ | 33 | 17.3% |
| – | 28 | 14.7% |
| „ | 23 | 12.0% |
| ” | 11 | 5.8% |
| ‘ | 7 | 3.7% |
| ‚ | 3 | 1.6% |
| | 2 | 1.0% |
| — | 1 | 0.5% |
Arabic
| Value | Count | Frequency (%) |
| ا | 25 | |
| ل | 17 | |
| ي | 14 | 9.0% |
| ب | 12 | 7.7% |
| د | 9 | 5.8% |
| ر | 9 | 5.8% |
| و | 8 | 5.1% |
| س | 8 | 5.1% |
| ن | 7 | 4.5% |
| ش | 6 | 3.8% |
| Other values (17) | 41 |
Hebrew
| Value | Count | Frequency (%) |
| ל | 9 | |
| א | 7 | |
| ו | 7 | |
| ה | 5 | |
| י | 5 | |
| ר | 4 | 6.7% |
| ם | 4 | 6.7% |
| ט | 3 | 5.0% |
| ש | 3 | 5.0% |
| נ | 2 | 3.3% |
| Other values (9) | 11 |
Hangul
| Value | Count | Frequency (%) |
| 진 | 7 | 3.1% |
| 영 | 6 | 2.7% |
| 최 | 6 | 2.7% |
| 동 | 5 | 2.2% |
| 유 | 5 | 2.2% |
| 이 | 5 | 2.2% |
| 은 | 4 | 1.8% |
| 정 | 4 | 1.8% |
| 희 | 4 | 1.8% |
| 사 | 4 | 1.8% |
| Other values (113) | 173 |
CJK
| Value | Count | Frequency (%) |
| 大 | 5 | 4.3% |
| 爸 | 4 | 3.4% |
| 雄 | 4 | 3.4% |
| 子 | 3 | 2.6% |
| 蕭 | 2 | 1.7% |
| 智 | 2 | 1.7% |
| 心 | 2 | 1.7% |
| 柏 | 2 | 1.7% |
| 毒 | 2 | 1.7% |
| 相 | 2 | 1.7% |
| Other values (77) | 89 |
Katakana
| Value | Count | Frequency (%) |
| ロ | 4 | |
| ペ | 4 | |
| ニ | 2 | |
| ト | 2 | |
| ッ | 2 | |
| ク | 2 | |
| ラ | 2 | |
| マ | 1 | 4.3% |
| ピ | 1 | 4.3% |
| ゴ | 1 | 4.3% |
| Other values (2) | 2 |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 3 | |
| ̂ | 3 | |
| ̀ | 1 | 14.3% |
Thai
| Value | Count | Frequency (%) |
| เ | 3 | |
| อ | 3 | |
| ก | 2 | 7.7% |
| า | 2 | 7.7% |
| น | 2 | 7.7% |
| ่ | 2 | 7.7% |
| ม | 2 | 7.7% |
| ี | 1 | 3.8% |
| แ | 1 | 3.8% |
| ค | 1 | 3.8% |
| Other values (7) | 7 |
Hiragana
| Value | Count | Frequency (%) |
| り | 1 | |
| お | 1 | |
| ん | 1 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| № | 1 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ầ | 1 | |
| ả | 1 | |
| ổ | 1 |
ActorName
Categorical
| Distinct | 42678 |
|---|---|
| Distinct (%) | 93.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 355.4 KiB |
| NoName | 2418 |
|---|---|
| Georges Méliès | 24 |
| Louis Theroux | 15 |
| Mel Blanc | 12 |
| Jimmy Carr | 9 |
| Other values (42673) |
Length
| Max length | 4551 |
|---|---|
| Median length | 1414 |
| Mean length | 187.78079 |
| Min length | 4 |
Characters and Unicode
| Total characters | 8539519 |
|---|---|
| Distinct characters | 395 |
| Distinct categories | 16 ? |
| Distinct scripts | 9 ? |
| Distinct blocks | 10 ? |
Unique
| Unique | 42472 ? |
|---|---|
| Unique (%) | 93.4% |
Sample
| 1st row | Tom Hanks, Tim Allen, Don Rickles, Jim Varney, Wallace Shawn, John Ratzenberger, Annie Potts, John Morris, Erik von Detten, Laurie Metcalf, R. Lee Ermey, Sarah Freeman, Penn Jillette |
|---|---|
| 2nd row | Robin Williams, Jonathan Hyde, Kirsten Dunst, Bradley Pierce, Bonnie Hunt, Bebe Neuwirth, David Alan Grier, Patricia Clarkson, Adam Hann-Byrd, Laura Bell Bundy, James Handy, Gillian Barber, Brandon Obray, Cyrus Thiedeke, Gary Joseph Thorup, Leonard Zola, Lloyd Berry, Malcolm Stewart, Annabel Kershaw, Darryl Henriques, Robyn Driscoll, Peter Bryant, Sarah Gilson, Florica Vlad, June Lion, Brenda Lockmuller |
| 3rd row | Walter Matthau, Jack Lemmon, Ann-Margret, Sophia Loren, Daryl Hannah, Burgess Meredith, Kevin Pollak |
| 4th row | Whitney Houston, Angela Bassett, Loretta Devine, Lela Rochon, Gregory Hines, Dennis Haysbert, Michael Beach, Mykelti Williamson, Lamont Johnson, Wesley Snipes |
| 5th row | Steve Martin, Diane Keaton, Martin Short, Kimberly Williams-Paisley, George Newbern, Kieran Culkin, BD Wong, Peter Michael Goetz, Kate McGregor-Stewart, Jane Adams, Eugene Levy, Lori Alan |
Common Values
| Value | Count | Frequency (%) |
| NoName | 2418 | 5.3% |
| Georges Méliès | 24 | 0.1% |
| Louis Theroux | 15 | < 0.1% |
| Mel Blanc | 12 | < 0.1% |
| Jimmy Carr | 9 | < 0.1% |
| Werner Herzog | 8 | < 0.1% |
| Louis C.K. | 8 | < 0.1% |
| George Carlin | 8 | < 0.1% |
| David Attenborough | 8 | < 0.1% |
| Trevor Noah | 6 | < 0.1% |
| Other values (42668) | 42960 |
Length
| Value | Count | Frequency (%) |
| john | 9809 | 0.8% |
| michael | 7464 | 0.6% |
| david | 6190 | 0.5% |
| robert | 5725 | 0.5% |
| james | 5693 | 0.5% |
| richard | 4446 | 0.4% |
| paul | 4320 | 0.4% |
| peter | 3903 | 0.3% |
| william | 3432 | 0.3% |
| george | 3416 | 0.3% |
| Other values (112949) | 1113662 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1122712 | 13.1% | |
| a | 707750 | 8.3% |
| e | 668087 | 7.8% |
| n | 524439 | 6.1% |
| , | 519745 | 6.1% |
| r | 497639 | 5.8% |
| i | 484270 | 5.7% |
| o | 426429 | 5.0% |
| l | 366664 | 4.3% |
| s | 256009 | 3.0% |
| Other values (385) | 2965775 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5663898 | |
| Uppercase Letter | 1195912 | 14.0% |
| Space Separator | 1122715 | 13.1% |
| Other Punctuation | 542058 | 6.3% |
| Dash Punctuation | 14112 | 0.2% |
| Other Letter | 543 | < 0.1% |
| Decimal Number | 94 | < 0.1% |
| Final Punctuation | 83 | < 0.1% |
| Initial Punctuation | 23 | < 0.1% |
| Open Punctuation | 23 | < 0.1% |
| Other values (6) | 58 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 707750 | |
| e | 668087 | |
| n | 524439 | |
| r | 497639 | 8.8% |
| i | 484270 | 8.6% |
| o | 426429 | 7.5% |
| l | 366664 | 6.5% |
| s | 256009 | 4.5% |
| t | 253361 | 4.5% |
| h | 198021 | 3.5% |
| Other values (138) | 1281229 |
Other Letter
| Value | Count | Frequency (%) |
| ا | 32 | 5.9% |
| م | 31 | 5.7% |
| ع | 19 | 3.5% |
| ی | 19 | 3.5% |
| ن | 18 | 3.3% |
| 松 | 17 | 3.1% |
| ر | 17 | 3.1% |
| د | 17 | 3.1% |
| ي | 16 | 2.9% |
| 美 | 12 | 2.2% |
| Other values (104) | 345 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 109410 | 9.1% |
| S | 92377 | 7.7% |
| C | 84052 | 7.0% |
| J | 83374 | 7.0% |
| B | 82422 | 6.9% |
| A | 70859 | 5.9% |
| R | 67418 | 5.6% |
| D | 65916 | 5.5% |
| L | 61183 | 5.1% |
| G | 54690 | 4.6% |
| Other values (81) | 424211 |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 37 | |
| 0 | 29 | |
| 2 | 8 | 8.5% |
| 1 | 8 | 8.5% |
| 9 | 4 | 4.3% |
| 4 | 2 | 2.1% |
| 3 | 2 | 2.1% |
| 7 | 2 | 2.1% |
| 6 | 1 | 1.1% |
| 8 | 1 | 1.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 519745 | |
| . | 16060 | 3.0% |
| ' | 6097 | 1.1% |
| " | 129 | < 0.1% |
| · | 9 | < 0.1% |
| : | 6 | < 0.1% |
| & | 6 | < 0.1% |
| ! | 5 | < 0.1% |
| / | 1 | < 0.1% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 10 | |
| ิ | 2 | 11.8% |
| ี | 1 | 5.9% |
| ่ | 1 | 5.9% |
| ึ | 1 | 5.9% |
| ์ | 1 | 5.9% |
| ั | 1 | 5.9% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 74 | |
| ” | 6 | 7.2% |
| » | 3 | 3.6% |
Space Separator
| Value | Count | Frequency (%) |
| 1122712 | ||
| 3 | < 0.1% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 20 | |
| « | 3 | 13.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| „ | 14 | |
| ( | 9 |
Format
| Value | Count | Frequency (%) |
| | 5 | |
| | 1 | 16.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 14112 |
Control
| Value | Count | Frequency (%) |
| 21 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 9 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 3 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6856726 | |
| Common | 1679148 | 19.7% |
| Cyrillic | 3070 | < 0.1% |
| Han | 276 | < 0.1% |
| Arabic | 241 | < 0.1% |
| Thai | 27 | < 0.1% |
| Greek | 14 | < 0.1% |
| Inherited | 11 | < 0.1% |
| Hangul | 6 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 707750 | 10.3% |
| e | 668087 | 9.7% |
| n | 524439 | 7.6% |
| r | 497639 | 7.3% |
| i | 484270 | 7.1% |
| o | 426429 | 6.2% |
| l | 366664 | 5.3% |
| s | 256009 | 3.7% |
| t | 253361 | 3.7% |
| h | 198021 | 2.9% |
| Other values (163) | 2474057 |
Han
| Value | Count | Frequency (%) |
| 松 | 17 | 6.2% |
| 美 | 12 | 4.3% |
| 田 | 11 | 4.0% |
| 龙 | 11 | 4.0% |
| 平 | 11 | 4.0% |
| 长 | 11 | 4.0% |
| 泽 | 11 | 4.0% |
| 雅 | 11 | 4.0% |
| 森 | 9 | 3.3% |
| 杰 | 9 | 3.3% |
| Other values (55) | 163 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 323 | 10.5% |
| и | 315 | 10.3% |
| о | 233 | 7.6% |
| н | 229 | 7.5% |
| р | 215 | 7.0% |
| е | 174 | 5.7% |
| л | 155 | 5.0% |
| к | 136 | 4.4% |
| т | 115 | 3.7% |
| с | 109 | 3.6% |
| Other values (51) | 1066 |
Common
| Value | Count | Frequency (%) |
| 1122712 | ||
| , | 519745 | |
| . | 16060 | 1.0% |
| - | 14112 | 0.8% |
| ' | 6097 | 0.4% |
| " | 129 | < 0.1% |
| ’ | 74 | < 0.1% |
| 5 | 37 | < 0.1% |
| 0 | 29 | < 0.1% |
| 21 | < 0.1% | |
| Other values (24) | 132 | < 0.1% |
Arabic
| Value | Count | Frequency (%) |
| ا | 32 | |
| م | 31 | |
| ع | 19 | 7.9% |
| ی | 19 | 7.9% |
| ن | 18 | 7.5% |
| ر | 17 | 7.1% |
| د | 17 | 7.1% |
| ي | 16 | 6.6% |
| ل | 9 | 3.7% |
| ب | 8 | 3.3% |
| Other values (18) | 55 |
Thai
| Value | Count | Frequency (%) |
| ว | 2 | 7.4% |
| น | 2 | 7.4% |
| ง | 2 | 7.4% |
| ิ | 2 | 7.4% |
| ร | 2 | 7.4% |
| า | 2 | 7.4% |
| ี | 1 | 3.7% |
| ส | 1 | 3.7% |
| ด | 1 | 3.7% |
| ธ | 1 | 3.7% |
| Other values (11) | 11 |
Hangul
| Value | Count | Frequency (%) |
| 조 | 1 | |
| 병 | 1 | |
| 만 | 1 | |
| 강 | 1 | |
| 계 | 1 | |
| 열 | 1 |
Greek
| Value | Count | Frequency (%) |
| ν | 6 | |
| Ζ | 2 | 14.3% |
| α | 2 | 14.3% |
| ί | 2 | 14.3% |
| ο | 2 | 14.3% |
Inherited
| Value | Count | Frequency (%) |
| ́ | 10 | |
| | 1 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8497424 | |
| None | 38289 | 0.4% |
| Cyrillic | 3070 | < 0.1% |
| CJK | 276 | < 0.1% |
| Arabic | 241 | < 0.1% |
| Punctuation | 120 | < 0.1% |
| Latin Ext Additional | 56 | < 0.1% |
| Thai | 27 | < 0.1% |
| Diacriticals | 10 | < 0.1% |
| Hangul | 6 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1122712 | 13.2% | |
| a | 707750 | 8.3% |
| e | 668087 | 7.9% |
| n | 524439 | 6.2% |
| , | 519745 | 6.1% |
| r | 497639 | 5.9% |
| i | 484270 | 5.7% |
| o | 426429 | 5.0% |
| l | 366664 | 4.3% |
| s | 256009 | 3.0% |
| Other values (66) | 2923680 |
None
| Value | Count | Frequency (%) |
| é | 9088 | |
| á | 4156 | 10.9% |
| í | 2756 | 7.2% |
| ô | 2332 | 6.1% |
| ö | 2025 | 5.3% |
| ó | 1882 | 4.9% |
| ü | 1495 | 3.9% |
| ć | 1360 | 3.6% |
| è | 1243 | 3.2% |
| ä | 996 | 2.6% |
| Other values (111) | 10956 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 323 | 10.5% |
| и | 315 | 10.3% |
| о | 233 | 7.6% |
| н | 229 | 7.5% |
| р | 215 | 7.0% |
| е | 174 | 5.7% |
| л | 155 | 5.0% |
| к | 136 | 4.4% |
| т | 115 | 3.7% |
| с | 109 | 3.6% |
| Other values (51) | 1066 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 74 | |
| “ | 20 | 16.7% |
| „ | 14 | 11.7% |
| ” | 6 | 5.0% |
| | 5 | 4.2% |
| | 1 | 0.8% |
Arabic
| Value | Count | Frequency (%) |
| ا | 32 | |
| م | 31 | |
| ع | 19 | 7.9% |
| ی | 19 | 7.9% |
| ن | 18 | 7.5% |
| ر | 17 | 7.1% |
| د | 17 | 7.1% |
| ي | 16 | 6.6% |
| ل | 9 | 3.7% |
| ب | 8 | 3.3% |
| Other values (18) | 55 |
CJK
| Value | Count | Frequency (%) |
| 松 | 17 | 6.2% |
| 美 | 12 | 4.3% |
| 田 | 11 | 4.0% |
| 龙 | 11 | 4.0% |
| 平 | 11 | 4.0% |
| 长 | 11 | 4.0% |
| 泽 | 11 | 4.0% |
| 雅 | 11 | 4.0% |
| 森 | 9 | 3.3% |
| 杰 | 9 | 3.3% |
| Other values (55) | 163 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ễ | 15 | |
| ạ | 9 | |
| ỳ | 6 | 10.7% |
| ị | 6 | 10.7% |
| ế | 5 | 8.9% |
| ả | 4 | 7.1% |
| ỗ | 4 | 7.1% |
| ề | 4 | 7.1% |
| ầ | 2 | 3.6% |
| ố | 1 | 1.8% |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 10 |
Thai
| Value | Count | Frequency (%) |
| ว | 2 | 7.4% |
| น | 2 | 7.4% |
| ง | 2 | 7.4% |
| ิ | 2 | 7.4% |
| ร | 2 | 7.4% |
| า | 2 | 7.4% |
| ี | 1 | 3.7% |
| ส | 1 | 3.7% |
| ด | 1 | 3.7% |
| ธ | 1 | 3.7% |
| Other values (11) | 11 |
Hangul
| Value | Count | Frequency (%) |
| 조 | 1 | |
| 병 | 1 | |
| 만 | 1 | |
| 강 | 1 | |
| 계 | 1 | |
| 열 | 1 |
| Budget | Popularity | Revenue | Runtime | VoteAverage | VoteCount | ReleaseYear | ReleaseMonth | Return | Id | OriginalLanguage | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Budget | 1.000 | 0.463 | 0.644 | 0.227 | 0.072 | 0.484 | 0.141 | 0.047 | 0.775 | -0.186 | 0.000 |
| Popularity | 0.463 | 1.000 | 0.491 | 0.307 | 0.241 | 0.893 | 0.186 | 0.072 | 0.447 | -0.279 | 0.000 |
| Revenue | 0.644 | 0.491 | 1.000 | 0.254 | 0.127 | 0.513 | 0.104 | 0.048 | 0.853 | -0.218 | 0.000 |
| Runtime | 0.227 | 0.307 | 0.254 | 1.000 | 0.193 | 0.290 | 0.034 | 0.072 | 0.234 | -0.161 | 0.111 |
| VoteAverage | 0.072 | 0.241 | 0.127 | 0.193 | 1.000 | 0.318 | -0.009 | 0.048 | 0.120 | -0.120 | 0.070 |
| VoteCount | 0.484 | 0.893 | 0.513 | 0.290 | 0.318 | 1.000 | 0.197 | 0.063 | 0.474 | -0.283 | 0.000 |
| ReleaseYear | 0.141 | 0.186 | 0.104 | 0.034 | -0.009 | 0.197 | 1.000 | -0.014 | 0.087 | 0.221 | 0.145 |
| ReleaseMonth | 0.047 | 0.072 | 0.048 | 0.072 | 0.048 | 0.063 | -0.014 | 1.000 | 0.048 | -0.029 | 0.047 |
| Return | 0.775 | 0.447 | 0.853 | 0.234 | 0.120 | 0.474 | 0.087 | 0.048 | 1.000 | -0.200 | 0.000 |
| Id | -0.186 | -0.279 | -0.218 | -0.161 | -0.120 | -0.283 | 0.221 | -0.029 | -0.200 | 1.000 | 0.046 |
| OriginalLanguage | 0.000 | 0.000 | 0.000 | 0.111 | 0.070 | 0.000 | 0.145 | 0.047 | 0.000 | 0.046 | 1.000 |
| Budget | Genres | OriginalLanguage | Overview | Popularity | ProductionCompanies | ProductionCountries | ReleaseDate | Revenue | Runtime | Tagline | Title | VoteAverage | VoteCount | ReleaseYear | ReleaseMonth | Return | Director | Id | MovieCharacter | ActorName | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 30000000.0 | Animation, Comedy, Family | en | Led by Woody, Andy's toys live happily in his room until Andy's birthday brings Buzz Lightyear onto the scene. Afraid of losing his place in Andy's heart, Woody plots against Buzz. But when circumstances separate Buzz and Woody from their owner, the duo eventually learns to put aside their differences. | 21.946943 | Pixar Animation Studios | US | 1995-10-30 | 373554033.0 | 81.0 | NaN | Toy Story | 7.7 | 5415.0 | 1995.0 | 10.0 | 12.451801 | ['John Lasseter'] | 862 | Woody (voice), Buzz Lightyear (voice), Mr. Potato Head (voice), Slinky Dog (voice), Rex (voice), Hamm (voice), Bo Peep (voice), Andy (voice), Sid (voice), Mrs. Davis (voice), Sergeant (voice), Hannah (voice), TV Announcer (voice) | Tom Hanks, Tim Allen, Don Rickles, Jim Varney, Wallace Shawn, John Ratzenberger, Annie Potts, John Morris, Erik von Detten, Laurie Metcalf, R. Lee Ermey, Sarah Freeman, Penn Jillette |
| 1 | 65000000.0 | Adventure, Fantasy, Family | en | When siblings Judy and Peter discover an enchanted board game that opens the door to a magical world, they unwittingly invite Alan -- an adult who's been trapped inside the game for 26 years -- into their living room. Alan's only hope for freedom is to finish the game, which proves risky as all three find themselves running from giant rhinoceroses, evil monkeys and other terrifying creatures. | 17.015539 | TriStar Pictures, Teitler Film, Interscope Communications | US | 1995-12-15 | 262797249.0 | 104.0 | Roll the dice and unleash the excitement! | Jumanji | 6.9 | 2413.0 | 1995.0 | 12.0 | 4.043035 | ['Joe Johnston'] | 8844 | Alan Parrish, Samuel Alan Parrish / Van Pelt, Judy Sheperd, Peter Shepherd, Sarah Whittle, Nora Shepherd, Carl Bentley, Carol Anne Parrish, Alan Parrish (young), Sarah Whittle (young), Exterminator, Mrs. Thomas the Realtor, Benjamin, Caleb, Billy Jessup, Cop, Bum, Jim Shepherd, Martha Shepherd, Gun Salesman, Paramedic, Paramedic, Girl, Girl, Baker, Pianist | Robin Williams, Jonathan Hyde, Kirsten Dunst, Bradley Pierce, Bonnie Hunt, Bebe Neuwirth, David Alan Grier, Patricia Clarkson, Adam Hann-Byrd, Laura Bell Bundy, James Handy, Gillian Barber, Brandon Obray, Cyrus Thiedeke, Gary Joseph Thorup, Leonard Zola, Lloyd Berry, Malcolm Stewart, Annabel Kershaw, Darryl Henriques, Robyn Driscoll, Peter Bryant, Sarah Gilson, Florica Vlad, June Lion, Brenda Lockmuller |
| 2 | 0.0 | Romance, Comedy | en | A family wedding reignites the ancient feud between next-door neighbors and fishing buddies John and Max. Meanwhile, a sultry Italian divorcée opens a restaurant at the local bait shop, alarming the locals who worry she'll scare the fish away. But she's less interested in seafood than she is in cooking up a hot time with Max. | 11.712900 | Warner Bros., Lancaster Gate | US | 1995-12-22 | 0.0 | 101.0 | Still Yelling. Still Fighting. Still Ready for Love. | Grumpier Old Men | 6.5 | 92.0 | 1995.0 | 12.0 | 0.000000 | ['Howard Deutch'] | 15602 | Max Goldman, John Gustafson, Ariel Gustafson, Maria Sophia Coletta Ragetti, Melanie Gustafson, Grandpa Gustafson, Jacob Goldman | Walter Matthau, Jack Lemmon, Ann-Margret, Sophia Loren, Daryl Hannah, Burgess Meredith, Kevin Pollak |
| 3 | 16000000.0 | Comedy, Drama, Romance | en | Cheated on, mistreated and stepped on, the women are holding their breath, waiting for the elusive "good man" to break a string of less-than-stellar lovers. Friends and confidants Vannah, Bernie, Glo and Robin talk it all out, determined to find a better way to breathe. | 3.859495 | Twentieth Century Fox Film Corporation | US | 1995-12-22 | 81452156.0 | 127.0 | Friends are the people who let you be yourself... and never let you forget it. | Waiting to Exhale | 6.1 | 34.0 | 1995.0 | 12.0 | 5.090760 | ['Forest Whitaker'] | 31357 | Savannah 'Vannah' Jackson, Bernadine 'Bernie' Harris, Gloria 'Glo' Matthews, Robin Stokes, Marvin King, Kenneth Dawkins, John Harris, Sr., Troy, Joseph, James Wheeler | Whitney Houston, Angela Bassett, Loretta Devine, Lela Rochon, Gregory Hines, Dennis Haysbert, Michael Beach, Mykelti Williamson, Lamont Johnson, Wesley Snipes |
| 4 | 0.0 | Comedy | en | Just when George Banks has recovered from his daughter's wedding, he receives the news that she's pregnant ... and that George's wife, Nina, is expecting too. He was planning on selling their home, but that's a plan that -- like George -- will have to change with the arrival of both a grandchild and a kid of his own. | 8.387519 | Sandollar Productions, Touchstone Pictures | US | 1995-02-10 | 76578911.0 | 106.0 | Just When His World Is Back To Normal... He's In For The Surprise Of His Life! | Father of the Bride Part II | 5.7 | 173.0 | 1995.0 | 2.0 | 0.000000 | ['Charles Shyer'] | 11862 | George Banks, Nina Banks, Franck Eggelhoffer, Annie Banks-MacKenzie, Bryan MacKenzie, Matty Banks, Howard Weinstein, John MacKenzie, Joanna MacKenzie, Dr. Megan Eisenberg, Mr. Habib, Wife Mrs. Habib | Steve Martin, Diane Keaton, Martin Short, Kimberly Williams-Paisley, George Newbern, Kieran Culkin, BD Wong, Peter Michael Goetz, Kate McGregor-Stewart, Jane Adams, Eugene Levy, Lori Alan |
| 5 | 60000000.0 | Action, Crime, Drama, Thriller | en | Obsessive master thief, Neil McCauley leads a top-notch crew on various insane heists throughout Los Angeles while a mentally unstable detective, Vincent Hanna pursues him without rest. Each man recognizes and respects the ability and the dedication of the other even though they are aware their cat-and-mouse game may end in violence. | 17.924927 | Regency Enterprises, Forward Pass, Warner Bros. | US | 1995-12-15 | 187436818.0 | 170.0 | A Los Angeles Crime Saga | Heat | 7.7 | 1886.0 | 1995.0 | 12.0 | 3.123947 | ['Michael Mann'] | 949 | Lt. Vincent Hanna, Neil McCauley, Chris Shiherlis, Nate, Michael Cheritto, Justine Hanna, Eady, Charlene Shiherlis, Sergeant Drucker, Lauren Gustafson, Bosko, Kelso, Richard Torena, Alan Marciano, Detective Casals, Donald Breedan, Trejo, Hugh Benny, Roger Van Zant, Waingro, Elaine Cheritto, Schwartz, Albert Torena, Dr. Bob, Ralph, Anna Trejo, Armoured Guard, Hooker's Mother, Timmons, Shooter at Drive-in, Driver at Drive-in, Officer Bruce, Claudia, Bosko's Date, Sergeant Heinz, Rachel, Captain Jackson, Harry Dieter, Bank Guard, Armoured Truck Driver, Hostage Girl, 1st SIS Detective in the hallway (uncredited), Solenko, Restaurant Manager (uncredited), Castilian Woman (uncredited), Lillian, Construction Clerk, Children's Hospital Doctor, Dominick, Bartender, Casals' Date, Marcia Drucker, Armoured Guard, Basketball Player, Children's Hospital Nurse, Detective, Prostitute, Bar Couple (uncredited), Restaurant Patron (uncredited), Police Woman (uncredited), Grocery Store Employee (uncredited), Cusamano (uncredited), Grocery Store Cop (uncredited), Waitress (uncredited), Bank Guard (uncredited), Ellis (uncredited) | Al Pacino, Robert De Niro, Val Kilmer, Jon Voight, Tom Sizemore, Diane Venora, Amy Brenneman, Ashley Judd, Mykelti Williamson, Natalie Portman, Ted Levine, Tom Noonan, Tone Loc, Hank Azaria, Wes Studi, Dennis Haysbert, Danny Trejo, Henry Rollins, William Fichtner, Kevin Gage, Susan Traylor, Jerry Trimble, Ricky Harris, Jeremy Piven, Xander Berkeley, Begonya Plaza, Rick Avery, Hazelle Goodman, Ray Buktenica, Max Daniels, Vince Deadrick Jr., Steven Ford, Farrah Forke, Patricia Healy, Paul Herman, Cindy Katz, Brian Libby, Dan Martin, Mario Roberts, Thomas Rosales, Jr., Yvonne Zima, Mick Gould, Bud Cort, Viviane Vives, Kim Staunton, Martin Ferrero, Brad Baldridge, Andrew Camuccio, Kenny Endoso, Kimberly Flynn, Niki Harris, Bill McIntosh, Rick Marzan, Terry Miller, Daniel O'Haco, Kai Soremekun, Peter Blackwell, Trevor Coppola, Mary Kircher, Darin Mangan, Robert Miranda, Manny Perry, Iva Franks Singer, Tim Werner, Philip Ettington |
| 6 | 58000000.0 | Comedy, Romance | en | An ugly duckling having undergone a remarkable change, still harbors feelings for her crush: a carefree playboy, but not before his business-focused brother has something to say about it. | 6.677277 | Paramount Pictures, Scott Rudin Productions, Mirage Enterprises, Sandollar Productions, Constellation Entertainment, Worldwide, Mont Blanc Entertainment GmbH | DE, US | 1995-12-15 | 0.0 | 127.0 | You are cordially invited to the most surprising merger of the year. | Sabrina | 6.2 | 141.0 | 1995.0 | 12.0 | 0.000000 | ['Sydney Pollack'] | 11860 | Linus Larrabee, Sabrina Fairchild, David Larrabee, Mrs. Ingrid Tyson, Maude Larrabee, Fairchild, Patrick Tyson, Elizabeth Tyson, Mack, Irene, Louis, Scott, Rosa, Joanna, Martine, Linda, Ron, Nurse, Carol, Ticket Taker, Singer at Larrabee Party, Butler, Red Head, Bartender, Kelly, India, Make-Up Assistant, Assistant, Model, Model, Model, Model, Model, Model, Paris Friend, Paris Friend, Paris Friend, Paris Friend, Paris Friend, Paris Friend, Helicopter Pilot, Gulf Stream Pilot, Sheik, Tyson Butler, Mother in Hospital, Father in Hospital, Trainer, Secretary, Moroccan Waiter, Senator, Japanese Businessman (uncredited), Airport Employee (uncredited), Head Butler (uncredited), Businessman in Window (uncredited), Wedding Guest (uncredited), Pizza Patron (uncredited), Ballroom Dancer (uncredited) | Harrison Ford, Julia Ormond, Greg Kinnear, Angie Dickinson, Nancy Marchand, John Wood, Richard Crenna, Lauren Holly, Dana Ivey, Fanny Ardant, Patrick Bruel, Paul Giamatti, Miriam Colón, Elizabeth Franz, Valérie Lemercier, Becky Ann Baker, John C. Vennema, Margo Martindale, J. Smith-Cameron, Christine Luneau-Lipton, Michael Dees, Denis Holmes, Jo-Jo Lowe, Ira Wheeler, Philippa Cooper, Ayako Kawahara, François Genty, Guillaume Gallienne, Inés Sastre, Phina Oruche, Andrea Behalikova, Jennifer Herrera, Kristina Kumlin, Eva Linderholm, Carmen Chaplin, Micheline Van de Velde, Joanna Rhodes, Alan Boone, Patrick Forster-Delmas, Kentaro Matsuo, Peter McKernan, Ed Connelly, Ronald L. Schwary, Alvin Lum, Siching Song, Phil Nee, Randy Becker, Susan Browning, Anthony Mondal, Peter Parks, Woodrow Asai, Eric Bruno Borgman, Michael Cline, Christopher Del Gaudio, Philippe Hartmann, Jerry Quinn, Dori Rosenthal |
| 7 | 0.0 | Action, Adventure, Drama, Family | en | A mischievous young boy, Tom Sawyer, witnesses a murder by the deadly Injun Joe. Tom becomes friends with Huckleberry Finn, a boy with no future and no family. Tom has to choose between honoring a friendship or honoring an oath because the town alcoholic is accused of the murder. Tom and Huck go through several adventures trying to retrieve evidence. | 2.561161 | Walt Disney Pictures | US | 1995-12-22 | 0.0 | 97.0 | The Original Bad Boys. | Tom and Huck | 5.4 | 45.0 | 1995.0 | 12.0 | 0.000000 | ['Peter Hewitt'] | 45325 | Tom Sawyer, Huck Finn, Becky Thatcher, Muff Potter, Aunt Polly, Injun Joe, Townsperson | Jonathan Taylor Thomas, Brad Renfro, Rachael Leigh Cook, Michael McShane, Amy Wright, Eric Schweig, Tamara Mello |
| 8 | 35000000.0 | Action, Adventure, Thriller | en | International action superstar Jean Claude Van Damme teams with Powers Boothe in a Tension-packed, suspense thriller, set against the back-drop of a Stanley Cup game.Van Damme portrays a father whose daughter is suddenly taken during a championship hockey game. With the captors demanding a billion dollars by game's end, Van Damme frantically sets a plan in motion to rescue his daughter and abort an impending explosion before the final buzzer... | 5.231580 | Universal Pictures, Imperial Entertainment, Signature Entertainment | US | 1995-12-22 | 64350171.0 | 106.0 | Terror goes into overtime. | Sudden Death | 5.5 | 174.0 | 1995.0 | 12.0 | 1.838576 | ['Peter Hyams'] | 9091 | Darren Francis Thomas McCord, Joshua Foss, Matthew Hallmark, Vizepräsident Daniel Bender, Tyler, Emily McCord | Jean-Claude Van Damme, Powers Boothe, Dorian Harewood, Raymond J. Barry, Ross Malinger, Whittni Wright |
| 9 | 58000000.0 | Adventure, Action, Thriller | en | James Bond must unmask the mysterious head of the Janus Syndicate and prevent the leader from utilizing the GoldenEye weapons system to inflict devastating revenge on Britain. | 14.686036 | United Artists, Eon Productions | GB, US | 1995-11-16 | 352194034.0 | 130.0 | No limits. No fears. No substitutes. | GoldenEye | 6.6 | 1194.0 | 1995.0 | 11.0 | 6.072311 | ['Martin Campbell'] | 710 | James Bond, Alec Trevelyan, Natalya Fyodorovna Simonova, Xenia Onatopp, Jack Wade, M, General Arkady Grigorovich Ourumov, Valentin Dmitrovich Zukovsky, Boris Grishenko, Defense Minister Dmitri Mishkin, Q, Miss Moneypenny, Bill Tanner, Caroline, Severnaya Duty Officer, Admiral Chuck Farrell, Computer Store Manager, Irina, Anna, Mig Pilot | Pierce Brosnan, Sean Bean, Izabella Scorupco, Famke Janssen, Joe Don Baker, Judi Dench, Gottfried John, Robbie Coltrane, Alan Cumming, Tchéky Karyo, Desmond Llewelyn, Samantha Bond, Michael Kitchen, Serena Gordon, Simon Kunz, Billy J. Mitchell, Constantine Gregory, Minnie Driver, Michelle Arthur, Ravil Isyanov |
| Budget | Genres | OriginalLanguage | Overview | Popularity | ProductionCompanies | ProductionCountries | ReleaseDate | Revenue | Runtime | Tagline | Title | VoteAverage | VoteCount | ReleaseYear | ReleaseMonth | Return | Director | Id | MovieCharacter | ActorName | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 45466 | NaN | NoGenre | NoLanguage | NoOverview | NaN | MissingValue | NoProductionCountries | NoReleaseDate | NaN | NaN | NaN | NoTitle | NaN | NaN | NaN | NaN | NaN | ['Jean Yarbrough'] | 84419 | The Creeper, Steven Morrow, Joan Medford, Police Lt. Larry Brooks, Marcel De Lange, F. Holmes Harmon, Hal Ormiston, Lady of the Streets, Stella McNally, Mr. Samuels, Jerry | Rondo Hatton, Robert Lowery, Virginia Grey, Bill Goodwin, Martin Kosleck, Alan Napier, Howard Freeman, Virginia Christine, Joan Shawlee, Byron Foulger, Syd Saylor |
| 45467 | NaN | NoGenre | NoLanguage | NoOverview | NaN | MissingValue | NoProductionCountries | NoReleaseDate | NaN | NaN | NaN | NoTitle | NaN | NaN | NaN | NaN | NaN | ['Ben Rock'] | 390959 | Debuty Hank Hart, Jeff Patterson, Kathy Patterson, Bill Barnes, Dr. Liam Woblick, Aidan James, Jeff Schoene, Dilva Henry, Vera Tenslue, News Reporter #2, John Huck, Kim Diamond, Miriam Lane, News Reporter #3, Frank Parsons, Dr. Clayton Larson, David Paulson, Bill Dixon, Donald McFerrell | Tony Abatemarco, Andre Brooks, Mariclare Costello, Bill Dreggors, Apollo Dukakis, Philip Friedman, James Gleason, Dilva Henry, Bari Hochwald, Wendy Hoffman, John Huck, Rachel Moskowitz, Sandy Mulvihill, Roger Nolan, Chris Parnell, Byrne Piven, Richard Sexton, Rich Williams, Ray Xifo |
| 45468 | NaN | NoGenre | NoLanguage | NoOverview | NaN | MissingValue | NoProductionCountries | NoReleaseDate | NaN | NaN | NaN | NoTitle | NaN | NaN | NaN | NaN | NaN | ['Ben Rock'] | 289923 | Branwall, Sarah Didonna, Kyle Brody, Bill Barnes, Rustin Parr, Heather Donahue, Joshua Leonard, Michael C. Williams | Monty Bane, Lucy Butler, David Grammer, Bill Dreggors, Frank Pastor, Heather Donahue, Joshua Leonard, Michael C. Williams |
| 45469 | NaN | NoGenre | NoLanguage | NoOverview | NaN | MissingValue | NoProductionCountries | NoReleaseDate | NaN | NaN | NaN | NoTitle | NaN | NaN | NaN | NaN | NaN | ['Aaron Osborne'] | 222848 | Kira (as Cassandra Leigh), Daly, Ruggs, Lewis, Billie, Dillon, Reitman, Ice, Announcer, Killa | Lisa Boyle, Kena Land, Zaneta Polard, Don Yanan, Debra K. Beatty, Mark Sikes, Robert J. Ferrelli, Ellyn Dawn Humphreys, Ron Jeremy, Ben Ramsey |
| 45470 | NaN | NoGenre | NoLanguage | NoOverview | NaN | MissingValue | NoProductionCountries | NoReleaseDate | NaN | NaN | NaN | NoTitle | NaN | NaN | NaN | NaN | NaN | ['John Irvin'] | 30840 | Sir Robert Hode, Maid Marian, Little John, Sir Miles Folcanet, Baron Roger Daguerre | Patrick Bergin, Uma Thurman, David Morrissey, Jürgen Prochnow, Jeroen Krabbé |
| 45471 | NaN | NoGenre | NoLanguage | NoOverview | NaN | MissingValue | NoProductionCountries | NoReleaseDate | NaN | NaN | NaN | NoTitle | NaN | NaN | NaN | NaN | NaN | ['Hamid Nematollah'] | 439050 | , , | Leila Hatami, Kourosh Tahami, Elham Korda |
| 45472 | NaN | NoGenre | NoLanguage | NoOverview | NaN | MissingValue | NoProductionCountries | NoReleaseDate | NaN | NaN | NaN | NoTitle | NaN | NaN | NaN | NaN | NaN | ['Lav Diaz'] | 111109 | Sister Angela, Homer, Crazy Woman/Virgin, Amang Tiburcio, Ex-convict/Dindo, Philosopher, Photographer, Ana/Call Center Woman, Filmmaker/Butcher, Poet of the Rain, Homer's mother | Angel Aquino, Perry Dizon, Hazel Orencio, Joel Torre, Bart Guingona, Soliman Cruz , Roeder, Angeli Bayani, Dante Perez, Betty Uy-Regala, Modesta |
| 45473 | NaN | NoGenre | NoLanguage | NoOverview | NaN | MissingValue | NoProductionCountries | NoReleaseDate | NaN | NaN | NaN | NoTitle | NaN | NaN | NaN | NaN | NaN | ['Mark L. Lester'] | 67758 | Emily Shaw, Det. Mark Winston, Jayne Ferré, Alex Tyler, Tony, Frank Bianci, Detective Stan, Kerry Shaw, Peter Quinn, Boyd, Sammy Benetto, Steve, Fred, Artie, Hitman #1, Doorman | Erika Eleniak, Adam Baldwin, Julie du Page, James Remar, Damian Chapa, Louis Mandylor, Tom Wright, Jeremy Lelliott, James Quattrochi, Jason Widener, Joe Sabatino, Kiko Ellsworth, Don Swayze, Peter Dobson, Darrell Dubovsky |
| 45474 | NaN | NoGenre | NoLanguage | NoOverview | NaN | MissingValue | NoProductionCountries | NoReleaseDate | NaN | NaN | NaN | NoTitle | NaN | NaN | NaN | NaN | NaN | ['Yakov Protazanov'] | 227506 | , , , , | Iwan Mosschuchin, Nathalie Lissenko, Pavel Pavlov, Aleksandr Chabrov, Vera Orlova |
| 45475 | NaN | NoGenre | NoLanguage | NoOverview | NaN | MissingValue | NoProductionCountries | NoReleaseDate | NaN | NaN | NaN | NoTitle | NaN | NaN | NaN | NaN | NaN | ['Daisy Asquith'] | 461257 | NoCharacter | NoName |